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(54) Title: SELECTIVE DESTRUCTION OF CELLS INFECTED WITH HUMAN IMMUNODEFICIENCY VIRUS 

(57) Abstract: Compositions and methods for selectively killing a cell containing a viral protease are disclosed. The composition is 
a variant of a protein synthesis inactivating toxin wherein a viral protease cleavage site is interposed between the A and B chains. The 
variant of the type U ribosome-inactivating protein is activated by digestion of the viral protease cleavage site by the specific viral 
protease. The activated ribosome-inactivating protein then kills the cell by inactivating cellular ribosomes. A preferred embodiment 
of the invention is specific for human immunodeficiency virus (HTV) and uses ricin as the ribosome-inactivating protein- In another 
preferred embodiment of the invention, the variant of the ribosome-inactivating protein is modified by attachment of one or more 
hydrophobic agents. The hydrophobic agent facilitates entry of the variant of the ribosome-inactivating protein into cells and can 
lead to incorporation of the ribosome-inactivating protein into viral particles. Still another preferred embodiment of the invention 
includes a targeting moiety attached to the variants of the ribosome-inactivating protein to target the agent to HIV infec table cells. 
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SELECTIVE DESTRUCTION OF CELLS INFECTED WITH HUMAN 

IMMUNODEFICIENCY VIRUS 

CONTRACTUAL ORIGIN OF THE INVENTION 

5 This invention was made with United States Government support under 

Contract No. DE-AC07-94ID13223, now Contract No. DE-AC07-99ID13727 
awarded by the United States Department of Energy. The United States Government 
has certain rights in the invention. 

10 RELATED APPLICATION 

This application claims priority from United States provisional application 
S/N 60/182,759 filed February 16, 2000 and is incorporated by reference. 

BACKGROUND OF THE INVENTION 

15 This invention relates to antiviral agents and methods of use thereof. More 

particularly, the invention relates to antiviral agents that specifically destroy cells 
infected by viruses that produce a protease in such infected cells. The antiviral 
agents are activated by the viral protease, thereby specifically targeting the infected 
cells for destruction.Toxins that target cell surface receptors or antigens on tumor 

2 0 cells have attracted considerable attention for treatment of cancer. E.g., I. Pastan & 
D. FitzGerald, Recombinant Toxins for Cancer Treatment, 254 Science 1173-1177 
(1991); Anderson et al., U.S. Patent Nos. 5,169,933 and 5,135,736; Thorpe et al., 
U.S. Patent No. 5,165,923; Jansen et al., U.S. Patent No. 4,906,469; Frankel, U.S. 
Patent No. 4,962,188; Uhr et al., U.S. Patent No. 4,792,447; Masuho et al., U.S. 

25 Patent Nos. 4,450,154 and 4,350,626. These agents include a cell-targeting moiety, 
such as an antigen-binding protein or a growth factor, linked to a plant or bacterial 
toxin. They kill cells by mechanisms different from conventional chemotherapy, 
thus potentially reducing or eliminating cross resistance to conventional 
chemotherapeutic agents. 
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Ricin and other similar plant toxins, such as abrin, modeccin and viscumin, 
comprise two polypeptide chains (known as the A and B chains) linked by a disulfide 
bridge, one chain (the A chain) being primarily responsible for the cytotoxicity and 
the other chain (the B chain) having sites that enable the molecule to bind to cell 
5 surfaces. Such toxins are known as type II ribosome-inactivating proteins or RIPs. 
F. Stirpe et al., Ribosome-inactivating Proteins from Plants: Present Status and 
Future Prospects, 'lO Biotechnology 405-412 (1992). 

Ricin is produced in the plant Ricinus communis (commonly known as the 
castor bean plant) via a precursor protein known as "preproricin." Preproricin 

10 comprises a single polypeptide chain that includes a leader sequence, the A chain, a 
linker peptide, and the B chain. The leader sequence is subsequently removed in the 
organism to yield proricin, which is then cleaved to eliminate the linker region such 
that the A and B chains remain connected only by a disulfide bond in the mature 
protein. The toxicity of ricintype toxins operates in three phases: (1) binding to the 

15 cell surface via the B chain; (2) penetration of at least the A chain into the cytosol via 
intracellular organelles, and (3) inhibition of protein synthesis through the A chain 
cleaving an essential adenine residue from ribosomal RNA. Thus, outside the cell 
separated A and B chains are essentially nontoxic, because the inherently toxic A 
chain lacks the ability to bind to cell surfaces and enter the cells in the absence of the 

2 0 B chain. Moreover, preproricin and proricin are also non-toxic, since the activity of 
the A chain is inhibited in these precursors.lt is also known that in ricintype toxins 
the B chain binds to cell surfaces by virtue of galactose recognition sites, which react 
with glycoproteins or glycolipids exposed on the cell surface. It has been suggested 
that the toxicity of the ricin A chain might be exploited in antitumor therapy by 

2 5 replacing the indiscriminatelybinding B chain with a different targeting component 
having the ability to bind only to tumor cells. Thus, various immunotoxins have 
been prepared consisting of a conjugate of whole ricin or a separated ricin A chain 
and a tumorspecific monoclonal antibody or other targeting component.While 
previously described immunotoxins comprising ricin are generally suitable for their 
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specific purposes, they possess certain inherent limitations that detract from their 
overall utility in treating viral infections. One problem with the known conjugates 
arises from a structural feature of the A chain from natural ricin. It is known that the 
natural ricin A chain becomes Nglycosylated during its synthesis, by enzymes 
5 present in Ricinus cells, and it is thought that the resulting sugar moieties are capable 
of nonspecific interactions with cell surfaces. Thus, it appears that the known A 
chain conjugates are capable of a certain amount of binding with non target cells, 
even in the absence of the natural B chain, thus increasing the toxicity of such 
immunotoxins toward non-target cells. To partially mitigate this problem, 

10 recombinant A chain that lacks carbohydrate residues has been produced in E. coli. 
S.H. Pincus & V.V. Tolstikov, Anti-Human Immunodeficiency Virus 
Immunoconjugates, 32 Adv. Pharmacol. 205-247 (1995). Another problem with 
many ricin immunoconjugates arises from the fact that the B chain seems to have an 
important secondary function in that it somehow assists in the intoxication process, 

15 apart from its primary function in binding the ricin molecule to the cell surfaces. 
This secondary function is lost if the B chain is replaced by a different targeting 
component, such as a monoclonal antibody. Some researchers have addressed this 
problem by covalent attachment of affinity reagents to the B chain such that the 
galactose binding sites are blocked. J.M. Lambert et al., An Immunotoxin Prepared 

2 0 with Blocked Ricin: a Natural Plant Toxin Adapted for Therapeutic Use, 5 1 Cancer 
Res. 6236-6242(1991). 

The aforementioned modifications of ricin seek to enhance binding 
specificity to the outer cell surface by immunotoxins and similar, targeted therapeutic 
agents. Since certain types of infected cells do not express infection-related surface 

25 antigens, such binding specificity represents an inherent limitation. S.H. Pincus & 
V.V. Tolstikov, supra, A targeting-independent agent with a well-defined toxin 
activation mechanism involving a viral protease would permit the use of nonspecific 
"targeting" (i.e., cell-binding) molecules, including sugar moieties and fully active 
ricin B chain. Therapeutic agents designed in this manner could eliminate a broader 
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spectrum of infected c ;lls, with potentially fewer undesirable side eflfects.Anti-HIV 
immunotoxins have bten described that include antibodies linked to various toxic 
moieties via a peptide linker that includes a sequence cleavable by HIV protease. 
S.H. Pincus & V.V. Tolstikov, supra. In some cases, release of the toxic moiety by 
5 this protease may render it active, although the specific activation mechanism was 
not further defined. In the present invention, antibodies or segments thereof are only 
one of many potential targeting molecules for the therapeutic agents. Moreover, the 
activation mechanisms are clearly specified in the present invention. One such 
mechanism relies on protease-dependent cleavage at or near the natural protease 

10 activation site for a given toxin, not merely on release from a bulky "carrier" protein 
(i.e., antibody). S.H. Pincus & V.V. Tolstikov, supra. In the case of ricin, the 
natural site for cleavage by proteolytic activity in Ricimts is in a disulfide- 
circumscribed loop in which one cysteine resides on the A chain and the other 
resides on the B chain; cleavage yields A and B chains connected by a disulfide 

15 bond. Therefore, most embodiments of the present invention that involve ricin 

include an HIV-protease cleavage sequence fused in- frame to the C-terminus of A 
chain such that the natural cleavage site is replaced with the HIV-protease site in the 
disulfide-circumscribed loop. In these embodiments, at lease some minimal N- 
terminal sequence of B chain required to inhibit A chain activity is retained, such that 

2 0 activation requires proteolytic cleavage and reduction of the disulfide bond. In all 

remaining embodiments, the mechanism of activation involves cleavage of a peptide 
linker to A chain, thereby separating adenine-like moieties that are chemically 
attached to the linker. Separation of the adenine-like residues unblocks the active 
site of ricin and allows A chain activity. Further, the foregoing text describes 

2 5 preferred embodiments (i.e., full B chain functionality, sugar moieties) that are 
highly compatible with these activation mechanisms. Indeed, these preferred 
embodiments are not suggested by others. S.H. Pincus & V.V. Tolstikov, 
supra. Certain embodiments of the present invention comprise attachment of 
hydrophobic moieties for intracellular targeting to sites of viral protease activity, 
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which is limited in the cytosol. The aforementioned immunotoxins do not possess 
this aspect of the invention. While attachment of hydrophobic fatty acids to ricin A 
chain has been presented in terms of enhancing translocation across cell membranes 
for hypothetical medical applications, a method for activating ricin was not 
5 presented. A.V. Kabanov et a!., Fatty Acylation of Proteins for Translocation Across 
Cell Membrane, 1 Biomed. Sci. 33-36 (1990); V.Y. Alakhov et ah, Increasing 
Cytostatic Effects *of Ricin A Chain and Staphylococcus aureus Enterotoxin A 
Through In Vitro Hydrophobization with Fatty Acid Residues, 12 Biotechnol. Appl. 
Biochem. 94-98 (1990). Hydrophobization of ricin is likely to increase toxicity to 

10 non-target cells, even if cell-surface targeting moieties are attached. A separate, 

viral-protease-dependent mechanism for activating ricin (and similar toxins) would 
prevent nonspecific toxicity. The present invention combines such a mechanism 
with hydrophobization. 

In view of the foregoing, it will be appreciated that providing an antiviral 

15 agent that is activated only in cells infected with a selected virus, is non-toxic in 

uninfected cells, and is targeted independently of infection-related antigens, would 
be a significant advancement in the art. Furthermore, the prior art teaches away from 
making the present invention because specific embodiments described herein have 
previously been described as deleterious (B chain activity). Moreover, the prior art 

2 0 fails to describe or suggest elements of the present invention (e.g., means for fatty 
acid attachment) in combination with a protease-dependent toxin activation 
mechanism. 

BRIEF SUMMARY OF THE INVENTION 

25 It is an object of the present invention to provide an antiviral agent that is 

toxic to virus-infected cells, but non-toxic to uninfected cells. 

It is also an object of the invention to provide an antiviral agent for treating 
viral infections wherein the virus encodes a protease that is essential to virus 
replication. 
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It is another object of the invention to provide a toxin (e.g., ricin) based 
antiviral agent that maintains the inhibitory functionality of the lectin B chain, 
whereby the lectin B chain (or a portion thereof) inhibits the activity of the ricin A 
chain prior to proteolytic cleavage of a linker sequence and reduction of the disulfide 
5 bond.lt is another object of the invention to provide a ricin- (or similar toxin) based 
antiviral agent that maintains the galactose-binding functionality of the ricin B chain, 
which enhances the binding of the antiviral agent to galactose residues on cell 
surfaces and the cellular internalization of the antiviral agent. 

It is still another object of the invention to provide an antiviral agent for 
10 treating retroviral infections, including HIV infections. 

It is yet another object of the invention to provide a method for treating 
retroviral infections wherein the virus encodes a protease that is essential to virus 
replication. 

It is also an object of the invention to provide a method for treating HIV 

15 infections. 

These and other objects can be addressed by providing a composition 
comprising a compound represented by the formula (T m -AXB)H n or (A-X-B-T m )-H n , 
wherein A is a protein synthesis inactivating toxin that is inactive until X is digested; 
X is a peptide susceptible to digestion by a viral protease; B is a lectin, T is a 

2 0 targeting moiety, H is a hydrophobic agent, m is 0 or an integer of at least 1 , and n is 
0 or an integer of at least 1 .In a preferred embodiment of the invention, A is a ricin A 
chain and B is a ricin B chain or segment thereof. In another preferred embodiment 
of the invention, X is susceptible to digestion by a retroviral protease, such as a 
human immunodeficiency virus protease. In an especially preferred embodiment, X 

25 is a member selected from the group consisting of SEQ ID NO: 12 and SEQ ID NO: 
13. Preferably, the targeting moiety is a member selected from the group consisting 
of antigen-binding proteins, viral surface components and segments thereof, growth 
factors, lectins, and carbohydrates. Especially preferred targeting moieties include a 
member selected from the group consisting of antigen-binding proteins, viral surface 
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components and segments thereof, proteins that bind viral surface components, 
growth factors, lectins, and carbohydrates. For example, such targeting moieties can 
include a member selected from the group consisting of antibodies against HIV 
glycoprotein gpl20, antibodies against gp41 , and the CD4 protein or segments 
5 thereof. As a further example, the targeting moiety can be an antigen-binding protein 
that binds the CD4 glycoprotein, such as gpl20 or a segment thereof. Still another 
illustrative targeting moiety is a GAG protein segment. 

The hydrophobic agent is preferably a member selected from the group 
consisting of bile acids, sterols, and saturated and unsaturated fatty acids. Preferred 

10 bile acids include cholic acid, deoxycholic acid, chenodeoxycholic acid, lithocholic 
acid, ursocholic acid, ursodeoxycholic acid, isoursodeoxycholic acid, 
lagodeoxycholic acid, glycocholic acid, taurocholic acid, glycodeoxycholic acid, 
glycochenodeoxycholic acid, dehydrocholic acid, hyocholic acid, hyodeoxycholic 
acid, and mixtures thereof. Preferred sterols include cholestanol, coprostanol, 

15 cholesterol, epicholesterol, ergosterol, ergocalciferol, and mixtures thereof. 

Preferred saturated or unsaturated fatty acid comprise about 4 to 20 carbon atoms,, 
such as butyric acid, valeric acid, caproic acid, caprylic acid, capric acid, lauric acid, 
myristic acid, palmitic acid, stearic acid, oleic acid, linoleic acid, linolenic acid, 
eleostearic acid, and mixtures thereof. 

2 o Another preferred embodiment of the invention further comprises a 

pharmaceutical^ acceptable carrier admixed with the compound. 

Still another preferred embodiment of the invention comprises a composition 
comprising a compound represented by the formula N-X-A or A-X-N, wherein A is a 
protein synthesis inactivating toxin that is inactive until digestion of X, X is a peptide 

25 susceptible to digestion by a viral protease, and N is an adenine moiety or a 

functional equivalent thereof.Yet another preferred embodiment of the invention 
comprises a method for treating a human immunodeficiency virus infection 
comprising administering an effective amount of a composition comprising: 
(a) a member selected from the group consisting of: 
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(i) 2 compound represented by the formula (T m -AXB)H n or 
(AXBT m )H m wlierein A is a protein synthesis inactivating toxin that is 
inactive until X is digested; X is a peptide susceptible to digestion by a 
human immunodeficiency virus protease; B is a lectin, T is a targeting 

5 moiety, H is a hydrophobic agent, m is 0 or an integer of at least 1, and n is 0 

or an integer of at least 1, 

(ii) 4 a compound represented by the formula N-X-A or A-X-N, 
wherein A is a protein synthesis inactivating toxin that is inactive until 
digestion of X, X is a peptide susceptible to digestion by a human 

10 immunodeficiency virus protease, and N is an adenine moiety or functional 

equivalent thereof, and 

(iii) mixtures of (i) and (ii); and 
(b) a pharmaceutically acceptable carrier. 

A further preferred embodiment of the invention comprises a nucleic acid 
15 encoding a peptide represented by the formula A-X-B wherein A is a protein 
synthesis inactivating toxin that is inactive until digestion of X; X is a peptide 
susceptible to digestion by a viral protease; and B is a lectin or other targeting 
moiety. 

20 DETAILED DESCRIPTION OF THE INVENTION 

Before the present antiviral agents and methods of use thereof for treating 
viral infections are disclosed and described, it is to be understood that this invention 
is not limited to the particular configurations, process steps, and materials disclosed 
herein as such configurations, process steps, and materials may vary somewhat. It is 
25 also to be understood that the terminology employed herein is used for the purpose of 
describing particular embodiments only and is not intended to be limiting since the 
scope of the present invention will be limited only by the appended claims and 
equivalents thereof. 
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The publications and other reference materials referred to herein to describe 

the background of the invention and to provide additional detail regarding its practice 

are hereby incorporated by reference. The references discussed herein are provided 

solely for their disclosure prior to the filing date of the present application. Nothing 

5 herein is to be construed as an admission that the inventors are not entitled to 

antedate such disclosure by virtue of prior invention, 
i 

It must be noted that, as used in this specification and the appended claims, 
the singular forms "a," "an," and "the" include plural referents unless the context 
. clearly dictates otherwise. Thus, for example, reference to a composition comprising 
10 "a carrier" includes reference to two or more of such carriers, reference to "a fatty 
acid" includes reference to one or more of such fatty acids, and reference to "a 
targeting moiety" includes reference to two or more of such targeting moieties. 

In describing and claiming the present invention, the following terminology 
will be used in accordance with the definitions set out below.As used herein, 
1 5 "comprising," "including," "containing," "characterized by," and grammatical 

equivalents thereof are inclusive or open-ended term's that do not exclude additional, 
unrecited elements or method steps. "Comprising" is to be interpreted as including 
the more restrictive terms "consisting of and "consisting essentially of." 

As used herein, "consisting of" and grammatical equivalents thereof exclude 
2 0 . any element, step, or ingredient not specified in the claim. 

As used herein, "consisting essentially of* and grammatical equivalents 
thereof limit the scope of a claim to the specified materials or steps and those that do 
not materially affect the basic and novel characteristic or characteristics of the 
claimed invention. 

2 5 As used herein, "protein synthesis inactivating toxin" includes toxins that are 

ribonucleases, N-glycosidases, or ADP-ribosyltransferases. N-glycosidases are 
exemplified by the single polypeptide of the plant type I ribosome inactivating 
proteins (e.g., gelonin, momordin, and saporin), and the "A" chain of the plant type 
II ribosome-inactivating proteins (e.g., ricin, abrin, modeccin, and the like) and 
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similar acting bacterial toxins. "Protein synthesis inactivating toxin" also includes 
the specific ribonucleases that digest a specific phosphodiester bond in the backbone 
of ribosomal RNA, thereby inactivating the ribosomes and inhibiting protein 
synthesis. Ribonucleases are exemplified by the fungal toxins alpha-sarcin, 
5 mitogillin, and restrictocin, but also include similar acting bacterial toxins. "Protein 
synthesis inactivating toxin" also includes the ADP-ribosylating component of the 
ADP-ribosyltransferases, which are proteolytically activated bacterial toxins that 
ADP-ribosylate, and thus inactivate, components of the protein synthesis machinery 
(e.g., diphtheria toxin, Pseudomonas exotoxin A).Plant ribosome-inactivating 

10 proteins (RIPs) are N-glycosidases that cleave the N-glycosidic bond of adenine in a 
specific ribosomal RNA sequence. Many RIPs are single-chain proteins (type I 
RIPs), but some (type II RIPs) possess a galactose-specific lectin domain that binds 
to cell surfaces. J. M. Ford, M. R. Hartley, L. M. Roberts, Ribosome Inactivating 
Proteins of Plants, 2 Seminars in Cell Biology 15-22 (1991). The type II RIPs are 

15 potent toxins, the best known of which is ricin. As used herein, u type II ribosome- 
inactivating proteins" or 'type II RIPs" means two-chain N-glycosidases that cleave 
the N-glycosidic bond of adenine in a specific ribosomal RNA sequence, wherein the 
two chains are an A chain, which possesses the N-glycosidase activity, and a B 
chain, which comprises a galactose-specific lectin domain that binds to cell surfaces. 

2 0 Ricin is the prototypical type II ribosome-inactivating protein, but other such type II 
RIPs include abrin (from Abrus precatrius), modeccin (from Adenia digtata), 
viscumin (from Viscum album), and volkensin (from Adenia volkensii). C.H. Hung 
et al., Cloning and Expression of Three Abrin A-chains and Their Mutants Derived 
by Site-specific Mutagenesis in Escherichia coli, 219 Eur. J. Biochem. 83-87 (1994); 

25 K.A. Wood et al., Preproabrin: Genomic Cloning, Characterisation and the 

Expression of the A-chain in Escherichia coli, 198 Eur. J. Biochem. 723-732 (1991); 
C.-H. Hung et al., Primary Structure of Three Distinct Isoabrins Determined by 
cDNA Sequencing: Conservation and Significance, 229 J. Mol. BioL 263-267 
(1993); GenBank accession numbers X76644, X76720, X76721, X76722, X54873, 
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X54872, X55667, A58957; International Application publication no. 
WO9701636.As used herein, "ricin A chain" means an N-glycosidase of about 32 
kDa that digests and inactivates 26S and 28S ribosomal RNA by cleavage of a 
specific adenine residue located within a highly conserved region of the 26S and 28$ 
5 ribosomal RNA, as is well known in the art. As used herein, "ricin B chain" means a 
galactose/N-acetylgalactosamine-binding lectin of about 34 kDa. SEQ ID NO: 1 
shows a DNA encoding preproricin and translation product thereof, wherein the 
signal peptide comprises amino acid residues -24 to -1, the A chain comprises amino 
acid residues 1 to 267, the linker peptide comprises amino acid residues 268 to 279, 

10 and the B chain comprises amino acid residues 280 to 541 . This sequence is 
incomplete at the 5' end (L. M. Roberts, J. W. Tregear, J. M. Lord, Molecular 
Cloning of Ricin, 7 Targeted Diagn Ther 81-97 (1992)) and the entire sequence is 
given in SEQ NO. 2 and SEQ NO. 3, which follows. SEQ ID NO:2 and SEQ ID 
NO: 3 show other DNAs encoding preproricin and their translation products thereof 

15 wherein the signal peptides comprise amino acid residues 35 to -1, the A chains 
comprise amino acid residues 1 to 267, the linker peptides comprise amino acid 
residues 268 to 279, and the B chains comprise amino acid residues 280 to 541 . 
Preferred ricin A chains and ricin B chains include the peptides having the amino 
acid sequences identified as SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, and 

2 0 biologically functional equivalents thereof. These functional equivalents include any 
two-subunit, proteolytically activatable, W-glycosidase toxins that inactivate 
ribosomes, i.e., type II ribosome-inactivating proteins. Such functional equivalents 
of the ricin A chain, for example, retain functionality in digesting ribosomal RNA 
although they may be truncations, deletion variants, or substitution variants of the 

25 ricin A chain portions of SEQ ID NO:l, SEQ ID NO:2, or SEQ ID NO:3 or include 
additional amino acid residues attached thereto. Functional equivalents of the ricin A 
chain may also be derived from any other type I or type II ribosome inactivating 
protein. Functional equivalents of the ricin B chain, for example, (a) retain 
functionality of binding to galactose residues on the surface of cells although they 
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may be truncations, de etion variants, or substitution variants of the ricin B chain 
portions of SEQ ID NO:l , SEQ ID NO:2, or SEQ ID NO:3 or include additional 
amino acid residues attached thereto; (b) retain functionality for inhibiting activity of 
the ricin A chain; (c) or both. Carbohydrate moieties may optionally be attached to 
5 the A or B chains. 

As mentioned above, changes may be made in the structure of the type II 
ribosome-inactivating protein A chain or B chain while maintaining the desirable N- 
glycosidase or galactose-binding characteristics thereof. For example, certain amino 
acid residues may be substituted for other amino acid residues in a protein structure 

10 without appreciable loss of activity. Since it is the interactive capacity and nature of 
a protein that defines that protein's biological functional activity, certain amino acid 
sequence substitutions can be made in a peptide's sequence and nevertheless obtain 
a protein with like properties. It is thus contemplated that various changes may be 
made in the sequence of a type II ribosome-inactivating protein A chain or B chain 

1 5 without appreciable loss of its biological utility or activity. 

It is also well understood by the skilled artisan that inherent in the definition 
of a biologically functional equivalent protein or peptide is the concept that there is a 
limit to the number of changes that may be made within a defined portion of the 
molecule and still result in a molecule with an acceptable level of equivalent 

2 0 biological activity. It is also well understood that where certain residues are shown 
to be particularly important to the biological or structural properties of a protein or 
peptide, e.g. residues in active sites, such residues may not generally be 
exchanged. Amino acid substitutions are generally based on the relative similarity of 
the amino acid side-chains relative to, for example, their hydrophobicity, 

2 5 hydrophilicity, charge, size, and the like. An analysis of the size, shape, and type of 
the amino acid side-chains reveals, for example, that arginine, lysine, and histidine 
are all positively charged residues at neutral pH; that alanine, glycine, and serine are 
all similar in size; and that phenylalanine, tryptophan, and tyrosine all have a 
generally similar shape. Therefore, based upon these considerations, the following 
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conservative substitution groups or biologically functional equivalents have been 
defined:(a) Cys; (b) Phe, Trp, Tyr; (c) Gin, Glu, Asn, Asp; (d) His, Lys, Arg; (e) Ala, 
Gly, Pro, Ser, Thr; and (0 Met, He, Leu, Val. M. Dayhoff et al., Atlas of Protein 
Sequence and Structure (Nat'l Biomed. Res. Found., Washington, D.C., 1978). 
5 Based on similar considerations, another grouping of amino acids is as follows: (a) 
Pro; (b) Ala, Gly; (c) Ser, Thr, (d) Asn, Gin; (e) Asp, Giu; (f) His; (g) Lys, Arg; (h) 
Cys; (i) He, Leu, Met, Val; and (j) Phe, Trp, Tyr. M. Jimenez-Montano & L. 
Zamora-Cortina, Evolutionary Model for the Generation of Amino Acid Sequences 
and its Application to the Study of Mammal Alpha-hemoglobin Chains, Proc. Vllth 

10 Int'l Biophysics Congress, Mexico City (1981). 

To effect more quantitative changes, the hydropathic index of amino acids 
may be considered. Each amino acid has been assigned a hydropathic index on the 
basis of its hydrophobicity and charge characteristics, which are as follows: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine 

15 (+2.5); methionine (+1.9); alanine (+1.8); glycine (0.4); threonine (0.7); serine (0.8); 
tryptophan (0.9); tyrosine (1.3); proline (1.6); histidine (3.2); glutamate (3.5); 
glutamine (3.5); aspartate (3.5); asparagine (3.5); lysine (3.9); and arginine (4.5).The 
importance of the hydropathic amino acid index in conferring interactive biological 
function on a protein is generally understood in the art. J. Kyte & R. Doolittle, A 

20 Simple Method for Displaying the Hydropathic Character of a Protein, 157 J. Mol. 
Biol. 105-132 (1982). It is known that certain amino acids may be substituted for 
other amino acids having a similar hydropathic index or score and still retain a 
similar biological activity. In making changes based on the hydropathic index, the 
substitution of amino acids whose hydropathic indices are within ± 2 is preferred, 

25 within ± 1 is particularly preferred, and within ± 0.5 is even more particularly 
preferred. 

It is also understood that an amino acid can be substituted for another having 
a similar hydrophilicity value and still obtain a biologically equivalent protein. As 
detailed in U.S. Patent No. 4,554,101, the following hydrophilicity values have been 
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assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0); 
giutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 
threonine (0.4); proline (0.5 ± 1); alanine (0.5); histidine (0.5); cysteine (1.0); 
methionine (1.3); valine (1.5); leucine (1.8); isoleucine (1.8); tyrosine (2.3); 
5 phenylalanine (2.5); tryptophan (3.4). 

In making changes based upon similar hydrophilicity values, the substitution 
of amino acids whose hydrophilicity values are within ± 2 is preferred, within ± 1 is 
particularly preferred, and within ± 0.5 is even more particularly preferred. 

As used herein, "viral protease" and similar terms mean proteases coded for 
10 by viral nucleic acids and expressed in virus-infected cells. For example, all known 
human retroviruses encode their own proteases. E.g., R.A. Katz & A.M. Skalka, The 
Retroviral Enzymes, 63 Annu. Rev. Biochem. 133-173 (1994); K. Von der Helm, 
Retroviral Proteases: Structure, Function and Inhibition from a Non-anticipated Viral 
Enzyme to the Target of a Most Promising HIV Therapy, 377 Biol. Chem. 765-774 
15 (1996). Retroviral proteases cleave a polyprotein precursor into a reverse 

transcriptase, capsid proteins, and other processed products. A variety of other 
viruses also encode their own proteases, including herpesviruses, hepatitis C viruses, 
rhinoviruses, and picornaviruses. A.K. Patick & K.E. Potts, Protease Inhibitors as 
Antiviral Agents, 11 Clin. Microbiol. Rev. 614-627 (1998); B.D. Korant, Viral 
2 0 Proteases: An Emerging Therapeutic Target, 8 Crit. Rev. Biotechnol. 149-1 57 
(1988). 

As used herein, "human immunodeficiency virus protease cleavage site" and 
similar terms mean peptides that are digested by HIV protease. HIV-1 and H1V-2 
proteases, although not identical, both cleave a viral GAG precursor protein of HIV-2 
2 5 at two very different sites to yield the same products. J.C. Wu et al., Synthetic HIV- 
2 Protease Cleaves the GAG Precursor of HIV-1 with the Same Specificity as HIV-1 
Protease, 277 Arch. Biochem. Biophys. 306-31 1 (1990). One such site in the GAG 
precursor is Val-Ser-Gln-Asn-Tyr-Pro-lle-Val-Gln-Asn (type 1 cleavage site wherein 
cleavage occurs between the tyrosine and proline residues; SEQ ID NO:4). The 
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second site is Ser-Ala-Thr-Ile-Met-Met-Gln-Arg-Gly-Asn (type 2 cleavage site 
wherein cleavage occurs between the two methionine residues, SEQ ID NO:l 1). 
There are many different sequences cleaved by HIV-1 protease, which generally fall 
into one of two types exemplified by these two sites: type 1 having Tyr(Phe)-Pro- 
5 and type 2 having hydrophobic residues (excluding proline) at the site of cleavage. J. 
T6zser et al, Studies on the Symmetry and Sequence Context Dependence of the 
HIV-1 Proteinase'Specificity, 272 J. Biol. Chem. 16807-16814 (1997). This article 
by Tozser et al. goes on to conclude that classification of retroviral cleavage sites 
into two types is an oversimplification and the strong sequence context dependence 

10 also raises difficulties for predicting cleavage sites. While the variability of 
sequences cleaved by HIV protease makes it difficult to predict cleavage sites, 
models have been successfully generated for the purpose of predicting these sites. 
K.C. Chou, A Vectorized Sequence-coupling Model for Predicting HIV Protease 
Cleavage Sites in Proteins, 268 J. Biol. Chem. 16938-16948 (1993); K.C. Chou, 

15 Prediction of Human Immunodeficiency Virus Protease Cleavage Sites in Proteins, 
233 Anal. Biochem. 1-14 (1996). Proteases from HIV-1 and HIV-2 recognize and 
cleave the same two types of sequences, although not necessarily the same sequences 
with the same cleavage efficiencies. Thus, it is possible to construct a therapeutic 
agent useful for both forms of the virus. HIV-1 is the form prevalent in the Western 

2 0 world, while HIV-2 is typically found in West African patients with AIDS. 
However, HIV-1 is likely to cause AIDS in West African patients as well. 

As used herein, "peptide" means peptides of any length and includes proteins. 
The terms "polypeptide" and "oligopeptide" are used herein without any particular 
intended size limitation, unless a particular size is otherwise stated. 

25 As used herein, "carbohydrate" means carbohydrate monomers, oligomers, 

and polymers. There is no particular intended size limitation with respect to 
carbohydrate oligomers or polymers unless a particular size is otherwise stated. 

As used herein, "lectin" means a class of proteins of nonimmunological 
origin that binds carbohydrates. The scope of lectins that can be used according to 
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the present invention ii limited only by functionality, i.e., binding to carbohydrates 
and/or inhibiting activity of a protein synthesis inactivating toxin to which it is fused. 
Segments or portions of lectins are also to be considered within the scope of the term 
"lectin" provided that such segments or portions retain the carbohydrate-binding 
5 and/or toxin-inhibiting function. Representative lectins that can be used according to 
the present invention include the following: abrin or jequirity bean (Abrus 
precatorius), asparagus pea or lotus or winged pea (Tetragonolobus purpureas)* 
avocado (Persea americana), bitter pear melon (Momordica charantia), broad bean 
(Vicia faba% camels foot tree (Bauhinia purpurea), castor bean (Ricinus communis), 
10 chick pea (Cicer arietinum), Mozambique cobra (Naja mocambique mocambique), 
Thailand cobra {Naja naja laculbia), concanavaiin A or jack bean {Canavalia 
ersiformis), Israel coral tree {Erythina oxaliodendron), daffodil {Narcissus 
pseudonarcissus), eel (Anguilla anguilla), elderberry {Sambucus nigra), furze or 
gorse (Ulex europaeus), green marine algae (Cocleum* fragile), hairy vetch (Vicia 
15 villosa), horse gram (Dolichos bijlorus), horseshoe crab or limulin (Limulus 
polyphemus), jacalin (Artocarpus integrifolid), Japanese wisteria (Wisteria 
floribunda), jimson weed or thorn apple (Datura stramonium), Scotch laburnum 
(Laburnum alpinum), lentil (Lens culinaris), lima bean {Phaseolus limensis), 
European mistletoe or viscumin (Viscum album), mung bean (Vigna radiata), 
2 0 mushroom (Agaricus bisporus), Osage orange (Madura pomifera), pagoda tree 
(Sophora japonica), pea (Pisum sativum), peanut (Arachis hypogaea), pokeweed 
(Phytolacca americana), potato (Solanum tuberosum), red kidney bean (Phaseolus 
vulgaris), red marine algae (Plifola plumosa), Roman snail (Helix porrata), scarlet 
runner bean (Phaseolus coccineus), Scotch broom (Cytisus scoparius), Siberian pea 
2 5 tree (Caragana arborescens), snail-edible (Helix pomatia), snail-garden (Helix 
aspersa), snowdrop (Galanthus nivalis), soybean (Glycine max), spindle tree 
(Euonymus europaeus), sweet pea (Lathyrus odoratus), tomato (Lycopersicon 
esculentum), wheat germ (Triticum vulgaris), winged bean {Psophoramus 
beliagonolobus). Preferred lectins include the B chains of type II RIPS, such as 
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ricin, abrin, modeccin, viscumin, and volkensin. Additional preferred lectins include 
tora-mame lectin (from Phaeolus vulgaris), phytohemagglutinin (PHA), wheat germ 
agglutinin, achatinin H, and Vicia villosa lectin. Still additional preferred lectins 
include lectins that bind to T lymphocytes, including mistletoe lectin, Phaseolus 
5 vulgaris leucoagglutinin, and lectins from Bandeiraea simplicifolia BS-I, Bauhinia 
purpurea. Glycine max y Lycopersicon esculentum, Triiicum vulgaris, Canavalia 
ensiformiSy Lens culinaris, Phaseolus coccineus, and Wisteria floribunda. Further 
additional preferred lectins include lectins that bind to sulfated carbohydrates, such 
as L-selectin and human galectin 1. Still further preferred lectins include lectins that 

10 bind to HIV gpl20, including concanavalin A, wheat germ agglutinin, Lens culinaris 
agglutinin, Vicia faba agglutinin, Pisum sativum agglutinin, and 
phytohaemagglutinin. Another group of preferred lectins include the B subunits of 
Shiga and Shiga-like toxins (SLTs), such as the B chains of Shiga toxin, SLT-I, SLT- 
II, SLT-IIv, and SLT-2e. Still other preferred lectins include jacalin, mannose 

15 binding proteins (e.g., concanavalin A), and maltose binding protein. 

As used herein, a "pharmaceutical^ acceptable" component is one that is 
suitable for use with humans and/or animals without undue adverse side effects (such 
as toxicity, irritation, and allergic response) commensurate with a reasonable 
benefit/risk ratio. Such materials are pharmaceutically acceptable in that they are 

2 0 nontoxic, do not interfere with drug delivery, and are not for any other reasons 
biologically or otherwise undesirable. 

As used herein, "effective amount" means an amount of a drug or 
pharmacologically active agent that is nontoxic but sufficient to provide the desired 
local or systemic effect and performance at a reasonable benefit/risk ratio attending 

2 5 any medical treatment. 

As used herein, "administering" and similar terms mean delivering the 
composition to the individual being treated such that the composition is capable of 
being circulated systemically to the parts of the body where the viral protease- 
sensitive linker can be digested by the viral protease, thus cleaving a peptide bond in 
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a linker that connects the type II ribosome-inactivating protein A chain and the type 
II ribosome-inactivating protein B chain. After such digestion, however, the A chain 
and the B chain will still be coupled to each other by a disulfide bond. Thus, the 
composition is preferably administered to the individual by systemic administration, 
5 typically by subcutaneous, intramuscular, or intravenous administration, or 
intraperitoneal administration. Injectables for such use can be prepared in 
conventional fontf s, either as a liquid solution or suspension or in a solid form 
suitable for preparation as a solution or suspension in a liquid prior to injection, or as 
an emulsion. Suitable excipients or carriers include, for example, water, saline, 

10 dextrose, glycerol, ethanol, and the like; and if desired, minor amounts of auxiliary 
substances such as wetting or emulsifying agents, buffers, and the like can be added. 
Other carriers can be used and are well known in the art. 

In its most basic embodiment, the present invention comprises a composition 
and method of use thereof, wherein the composition comprises a type II ribosome- 

15 inactivating protein precursor protein, such as proricin, wherein the natural linker is 
replaced with a viral-protease-cleavable linker. In an especially preferred 
embodiment of the invention, the natural linker is replaced with an HlV-protease- 
cieavable linker. Generally, there are two types of sequences cleaved by HIV 
proteases, which can be identified with some predictability. In certain embodiments, 

20 a major advantage of, for example, a proricin variant over typical immunotoxins is 
that cells in early stages of infection can be killed, thus preventing HIV replication. 

Targeting molecules can be added to various sites on the proricin-based agent 
in addition to or for replacing the targeting activity of the B chain. Targeting 
molecules include, but are not limited to, CD4 and derivatives thereof; antibodies 

25 such as anti-gpl20, anti-gp41, and the like; IL-2 segments; gpI20 segments; gag 
segments; fatty acid or other hydrophobic substituents; and mannose-containing 
carbohydrate moieties. Especially preferred targeting molecules include gpl20 
segments and fatty acid or other hydrophobic substituents. Fatty acid groups may 
permit direct translocation of the agent into cells, rendering the activity of the B 
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chain unnecessary, even though the presence of the B chain or a segment thereof 
would remain important so that the A chain would be inactive until proteolytically 
activated by HTV protease. An alternative mechanism for proteolytic activation 
involves an adenine or similar group attached to an HIV-protease-cleavable 
5 extension linker on the A chain, which will inhibit A chain activity until cleaved by 
HIV protease. 

A major advantage of the present invention is that the antiviral agent is 
activated in viral particles or early-stage infected cells, killing the cells upon 
infection and effectively preventing the integration of the viral genome into the host 

10 genome and preventing the latency/rebound problem. Another major advantage in 
certain embodiments is that the present invention should be able to enter all HIV 
susceptible cells, not just cells known to act as host cells for the virus. Moreover, the 
agent remains inert in a cell until degraded therein, unless the cell is infected with the 
virus, wherein the viral protease activates the agent. 

15 Human immunokeficiency virus (HIV) is representative of viruses that 

encode their own protease enzymes that cleave specific sites on viral proteins during 
the replication process. J. Gatlin et al., Regulation of Intracellular Human 
Immunodeficiency Virus Type-1 Protease Activity, 244 Virology 87-96 (1998). 
Examples of other protease-encoding viruses are yellow fever virus and tick-bome 

2 0 encephalitis virus. T.J. Chambers et al., Evidence that the N-terminal Domain of 
Nonstructural Protein NS3 from Yellow Fever Virus is a Serine Protease 
Responsible for Site-specific Cleavages in the Viral Polyprotein, 87 Proc. NatM 
Acad. Sci. USA 8898-8902 (1990); K.V. Pugachev et al., Site-directed Mutagenesis 
of the Tick-borne Encephalitis Virus NS3 Gene Reveals the Putative Serine Protease 

25 Domain of the NS3 Protein, 328 FEBS Lett. 115-118 (1993). Vaccinia virus, which 
is so closely related to the smallpox virus that it was used as the basis for a vaccine 
against smallpox, apparently uses a protease specifically located within infected 
cells, R.M.L. Buller et al., Poxvirus Pathogenesis, 55 Microbiol. Rev. 80-122 (1991). 
The smallpox virus may use a similar protease. 
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The therapeutic agent described herein is based on type II ribosome- 
inactivating proteins, such as the protein toxin, ricin, and is generally applicable to 
viruses that encode site-specific proteases. From this point in the application, ricin 
will be used as a preferred example of a type II ribosome-inactivating protein, but it 
5 is intended that the scope of the invention include any protein synthesis inactivating 
toxin, as previously defined. As briefly reviewed above, ricin from castor beans 
comprises two su6units (A and B) connected by a disulfide bond, with carbohydrate 
chain(s) attached to each. J.M. Lord et al., Ricin: Structure, Mode of Action, and 
Some Current Applications, 8 FASEB J. 201-208 (1994); U.S. Patent No. 5,622,838. 
10 The A subunit is an enzyme that cleaves a critical adenine residue from ribosomal 
RNA, thereby inactivating ribosomes. The B subunit is a lectin that binds to cell- 
surface galactose residues and also contributes to cellular internalization of the A-B 
toxin. 

The rationale for using ricin-based therapeutic agents against HTV has been 
15 documented. M.G.C.T. Van Oijen et al., Rationale for Use of Immunotoxins in the 
treatment of HIV-infected Humans, 5 J. Drug Target 75-91 (1997). Viruses 
appropriate the metabolic machinery of infected host cells to replicate themselves. 
By inactivating the machinery for protein synthesis, i.e., ribosomes, and thereby 
killing infected cells, selective ricin-based agents can prevent the replication of HIV. 
2 0 This strategy relies on attaching targeting molecules that bind to infected cells. 

There are two major drawbacks to this approach. First, the viral antigens to which 
the targeting molecules bind may either be shed from the cell surfaces, J.M. Lord et 
al., supra, or occur on free viral particles, not just cell surfaces. Thus the intended 
targets (infected cells) are not efficiently destroyed. Second, the ricin-plus-targeting 
2 5 molecule conjugates must be administered in large doses, presumably because they 
are not ideally suited to the complex internalization pathway whereby ricin enters 
cells. B.M. Simmons et al., Mannose Receptor-mediated Uptake of Ricin Toxin and 
Ricin A Chain by Macrophages, 261 J. Biol. Chem. 7912-7920 (1986). 



WO 01/60393 



PCT/US01/05282 



21 

The present approach involves a modification of native proricin, which is the 
precursor of ricin that differs only in that the A and B subunits are connected by a 
peptide linker to form a continuous proricin peptide. Cleavage of proricin by a 
protease in castor beans (the source of ricin) yields the mature ricin protein. In the 
5 present invention, the naturally occurring peptide linker between the ricin A chain 
and the ricin B chain is replaced by an HIV-protease-cleavable site. Research results 
suggest that the A 1 and B subunits are not normally separated until ricin reaches the 
cytoplasm, wherein the connecting disulfide bond is cleaved, J.P. Frenoy et al., 
Uptake of Injected 125 I-ricin by Rat Liver In Vivo, 284 Biochem. J. 249-257 (1992); 

10 hence, a modified peptide linker connecting the A and B subunits should not 

interfere with cellular internalization of the protein variant. Indeed, internalization of 
this variant should be significantly more efficient than for immunotoxins that include 
targeting molecules.^ 

These data strongly suggest that incorporating an HIV-protease-sensitive 

15 linker into proricin will produce an agent that selectively kills HIV-infected cells. 
Moreover, the proricin variant should work against both forms of the virus, HIV-1 

. \ : .... 

and HIV-2, because it has been shown that HIV-2 protease cleaves the polyprotein 
precursor of HIV-1 with the same specificity as the HIV-1 protease. J.C. Wu et al., 
supra. 

20 The proricin variant will bind to any cell that displays galactose residues on 

the cell surface. Another route for cellular uptake involves the mannose-containing 
carbohydrate chains attached to ricin as it is produced in castor beans and as it should 
be produced in a yeast cell host. These mannose groups are bound by mannose 
receptors on cell surfaces. B.M. Simmons et al., supra. Additional routes for 

2 5 cellular uptake could be engineered by attaching other sugar-binding lectins to the N- 
terminus of the ricin A chain or the C-terminus of the B chain. Other proteins have 
been attached to the N-terminus of the A chain without affecting ricin activity or 
internalization. B. Beaumelle et al., Ricin A Chain Can Transport Unfolded 
Dihydrofolate Reductase into the Cytosol, 272 J. Biol. Chem. 22097-22102 (1997). 
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Together, the many possible proricin variants will be able to enter virtually any cell 
type, but will be activated within and kill only virus-infected cells. Although the 
preferred embodiment of the present invention, described above, is designed to be 
activated in cells infected with HIV, it is also possible that other proteases present in 
5 the healthy human body might activate the proricin-based therapeutic agent. 

Therefore, it would be advantageous to increase the selectivity of the therapeutic 
agent still further. 1 One solution for increasing selectivity would be to insert a linker 
containing a type 2 HIV-protease-cleavable site into proricin instead of a type 1 
cleavable site. A protease found naturally in the human body may cleave one of the 

10 linker types, and activate the therapeutic agent. By changing the linker sequence, 
selectivity for HIV protease may be increased by avoiding linker cleavage by 
naturally occurring proteases. 

A second solution to avoiding activation by nonspecific proteases is to insert 
amino acid residues into natural, exposed loops of A chain that are also found in the 

15 cleavable linker; such insertions cannot significantly reduce A chain activity. Also, 
these insertions cannot contain sequences cleavable by the viral protease. Therefore, 
if a host (i.e., human) protease can cleave the linker at a sequence also found in an A- 
chain loop insertion, the host protease will inactivate the ricin A chain and prevent 
destruction of uninfected cells. 

2 0 Another solution to increasing selectivity is to attach a targeting moiety to the 

proricin variant. Five categories of targeting molecules are discussed herein: (1 ) 
Targeting to the surfaces of infected cells displaying viral proteins as the targets, e.g., 
gpl20. Such targeting molecules would include antibodies, viral proteins or portions 
thereof, and other proteins that bind to such viral components (e.g., CD4). Typical 

2 5 immunotoxins use this type of targeting molecule. (2) Targeting to infectable or 
susceptible cells. Molecules used for targeting include proteins that bind to CD4. 
(3) Targeting to virtually any cell. The ricin B chain binds to any cell displaying 
galactose residues. Mannose chains on ricin A and B chains could bind to mannose 
receptors. Fatty acids permit internalization into a variety of cells. Lectins other 
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than the ricin B chain can also be used. (4) Intracellular targeting to the inner surface 
of the cell membrane for incorporation into viral particles. Fatty acids and/or 
segments of the GAG protein attached to the antiviral agent will promote this 
process. Any protein segment to which fatty acids are attached by enzymes in vivo 
5 can be fused to the antiviral agent. (5) Targeting with viral vectors, such as an HIV- 
based vector that would ultimately insert DNA encoding the antiviral agent into 
susceptible cells. t)ne or more of these five different types of targeting strategies can 
be used. 

A preferred targeting molecule for attachment to the modified proricin of the 

1 0 present invention is a molecule that targets the CD4 glycoprotein on cell surfaces or 
an antibody against an HIV envelope glycoprotein (S.H. Pincus, Therapeutic 
Potential of Anti-HIV Immunotoxins, 33 Antiviral Res. 1-9 (1996)). As is well- 
known in the art, attachment of HIV to cells involves the interaction of the HIV 
gpl20 envelope glycoproteins with specific receptors on cell surfaces-the CD4 

15 glycoprotein and members of the chemokine receptor family. R. Wyatt & J. 
Sodroski, The HIV-1 Envelope Glycoproteins: Fusogens, Antigens, and 
Immunogens, 280 Science 1884-1888 (1998). The CD4 glycoprotein is expressed on 
the surface of T lymphocytes, monocytes, dendritic cells, and brain microglia, the 
main target cells for primate immunodeficiency viruses in vivo. Use of CD4 protein 

2 0 or a derivative thereof as a targeting molecule would cause the present invention to 
bind surface-exposed pg!20 and enter HIV-infected cells and cause inactivation of 
ribosomes in such cells, thus destroying the ability of the cells to produce new HIV 
proteins. However, this approach would only target cells in the late stage of 
infection that are expressing viral proteins on the cell surface. The problem at that 

2 5 point is that the viral genome has already been incorporated into the host cell 
genome. If viral particles are not being actively produced and their proteins 
displayed on the cell surface, then the cell will avoid the toxic effects of the CD4- 
based proricin variant. The virus could rebound at a later date upon cellular 
activation. G. Mathe, The Kinetics of Cancer Cells and of HIV1 : The Problems of 



WO 01/60393 



PCT/US01/05282 



24 

Cell and Virus Rebounds and of Latency, 52 Biomed. Pharmacother. 413-420 
(1998). 

HIV infections of central nervous system (CNS) cells can involve galactosyl 

ceramide as a receptor, enabling virus entry by a CD4-independent pathway. S.H. 

5 Pincus & V. V. Tolstikov, supra. Furthermore, infected CNS cells express little or no 

viral antigens on their surfaces. While these are not major sites of viral replication, 

such infected cells 1 may represent a barrier to complete elimination of viral 

replication. Classical immunotoxins are poor candidates for attacking such cells. In 

contrast, the present embodiments of the invention involving fully functional B chain 

10 could readily bind to galactosyl ceramide, yet remain inactive unless HIV protease is 

encountered. Further, a compatible embodiment involving fatty acid attachment 

would facilitate crossing of the blood-brain barrier, which is a major impediment to 

the efficacy of immunotoxins. 
•* ■ ' - 

Another targeting molecule would be a portion of interleukin-2 (IL-2) that 

1 5 binds to the high-affinity IL-2 receptors found on a subset of activated T- 

lymphocytes thought to be the main or sole site of HIV replication. However, it has 
not been demonstrated that this is the sole site in infected humans. S.H. Pincus, 
supra. Furthermore, the expression of IL-2 receptors on the surface of T cells 
appears to result from HIV infection or the binding of HIV gpl20 to CD4 on the T- 

2 0 cell surface, R. W. Finberg et ah, Selective elimination of HI V- 1 -infected Cells with 
an Interleukin-2 Receptor-specific Cytotoxin, 252 Science 1703-1705 (1991). 
Therefore, use of a segment of IL-2 on a classical immunotoxin would target infected 
cells, although T cells activated by non-HIV stimulants would also be eliminated. 
S.H. Pincus & V.V. Tolstikov, supra. However, infected cells targeted in this 

2 5 manner should be at an earlier stage of infection relative to cells targeted by 

immunotoxins containing CD4 segments or antibodies against viral proteins. A 
portion of IL-2 has been fused to diphtheria toxin as an anti-HIV "immunotoxin." 
R.W. Finberg et al., supra. 
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Since use of IL-2 as a targeting molecule on classical immunotoxins could 
result in destruction of uninfected cells and deleterious health effects, the protease 
activation mechanism of the present invention will be important to confer additional 
selectivity. A strategy for attacking viral latency (stable, dormant provirus integrated 
5 into host DNA) is to activate infected T cells to express IL-2 receptors and perhaps 
viral proteins by exposing patients to exogenous HIV gpl20. H. Komfeld, W. W. 
Cruikshank, S. W. J Pyle, J. S. Berman. D. M Center, Lymphocyte Activation by 
HIV-1 Envelope Glycoprotein, 335 Nature 6189 (1988). After several hours, 
treatment with a protease-activated therapeutic agent containing IL-2 would destroy 

10 infected cells. Without activation of the provirus, the therapeutic agent would likely 
be degraded in the cell before it could be activated. Alternatively, use of the 
carbohydrate-(galactose-) binding B chain or another lectin separately or as part of 
the toxin-based therapeutic agent could activate infected T cells. B.A. Sela et al., 
Lymphocyte Activation l>y Monovalent Fragments of Antibodies Reactive with Cell 

15 Surface Carbohydrates, 143 J. Exp. Med. 665-671 (1976); O. Closs et al., 

Stimulation of Human Lymphocytes by Galactose-specific Abrus and Ricinus 
Lectins, 1 1 5 J. Immunol 't 045- 1 048 ( 1 975). 

A better approach would be to kill infected cells at a very early stage of 
infection, i.e. before the integration of the viral genome. To do this would require a 

2 0 different targeting molecule. One possibility is attaching an octapeptide (or 

derivative thereof) from the HIV envelope glycoprotein gpl20. C.B. Pert et al., 
Octapeptides Deduced from the Neuropeptide Receptor-like Pattern of Antigen T4 in 
Brain Potently Inhibit Human Immunodeficiency Virus Receptor Binding and T-cell 
Infectivity, 83 Proc. NatM Acad. Sci. USA 9254-9258 (1986). This short sequence is 

2 5 believed to play an important role in HIV attachment to cells. It could be 

incorporated into the proricin variant at its N-terminus or at another site in the 
variant. The receptor for this octapeptide is the T4 or CD4 protein. Entry of primate 
immunodeficiency viruses into the host cell involves the binding of the gpl20 
envelope glycoprotein to the CD4 glycoprotein, which serves as the primary 
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receptor. P.D. Kwong et al., Structure of an HIV gpl20 Envelope Glycoprotein in 
Complex with the CD4 Receptor and a Neutralizing Human Antibody, 393 Nature 
648-659 (1998). This receptor is also located on membranes of cells in the human 
brain. Interestingly, patients with AIDS show neuropsychological deficits. This 
5 embodiment of the invention could thus enter any cells that attach to HIV particles, 
but would be activated only when HIV protease is present. 

Other portions of viral proteins could also be fused to the proricin variant. 
One of these is a largely deglycosylated "gpl20 core" consisting of a truncated 
version of the native gpl20 with mutations in some of the loops. P.D. Kwong et al., 

1 0 supra; R. Wyatt et al., The Antigenic Structure of the HIV gpl20 Envelope 

Glycoprotein, 393 Nature 705-71 1 (1998). Several antigenic residues in gpl20 
interact with the CD4 protein. R. Wyatt et al., supra. Peptides based on these data 
can be fused to the proricin variant. Also, a short segment of the V3 loop of the 
gpl 20 protein can bind to "secondary" cell receptors. J.B. Ghiara et al., Crystal 

15 Structure of the Principal Neutralization Site of HIV-i, 264 Science 82-85 (1994). 
This segment could be flanked by cysteine residues to form a disulfide bond in the 
genetically engineered variant and thereby stabilize the immunogenic secondary 
structure of the V3 loop segment. A. Zhang et al., A Disulfide-bound HIV-1 V3 
Loop Sequence on the Surface of Human Rhinovirus 14 Induces Neutralizing 

20 Responses Against HIV-1, 380 Biol. Chem. 365-374 (1999). 

Immunogenic peptides that represent epitopes on the HIV gag protein might 
also serve as targeting molecules. J.M. Claverie et al., T-immunogenic Peptides Are 
Constituted of Rare Sequence Patterns. Use in the Identification of T Epitopes in the 
Human Immunodeficiency Virus Gag Protein, 1 8 J. Eur. Immunol. 1 547- 1553 

2 5 (1988). The gag segment will direct an agent that contains it to be incorporated into 
the virus particle forming at the inner surface of the plasma membrane. C.T. Wang 
et al., Sequence Requirements for Incorporation of Human Immunodeficiency Virus 
Gag- 2 -galactosidase Fusion Proteins into Virus-like Particles, 59 J. Med. Virol. 1 80- 
188 (1999); M. Bryant & L. Ratner, Myristoylation-dependent Replication and 
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Assembly of Human Immunodeficiency Virus 1, 87 Proc. Nat 1 Acad. Sci. USA 523- 
527 (1990), An appropriate segment may include the N-terminus of the Pr55gag 
protein, which is attached to myristic acid in vivo, thereby causing the myristylated 
protein to associate with the inner surface of the plasma membrane and the nascent 
5 virus particle. 

Peptide targeting molecules can be coupled to the proricin variant by 
expression of a recombinant fusion protein, chemical synthesis and attachment, or 
any other suitable method known in the art. Such techniques for producing 
recombinant fusion proteins are well-known in the art, and are described generally in, 

10 e.g., J. Sambrook & D. Russell, Molecular Cloning: A Laboratory Manual (3d ed., 
2001). Reagents useful in applying such techniques, such as restriction 
endonucleases and the like, are widely known in the art and commercially available 
from any of several vendors. 

The activity of HIV protease is regulated such that it is minimal in the 

15 cytoplasm and most active inside the virus particle. J. Gatlin et al., supra. The 
activity in the cytoplasm may be sufficient to activate the proricin variant of this 
invention. This phenomenon of high activity in the viral particle, however, could be 
used to advantage to kill the cells upon infection, prior to integration of the viral 
DNA into the host DNA. By attaching fatty acid residues to the proricin variant, it 

2 0 would bind to the inner surface of the cell membrane and be incorporated into the 
budding virus particle just as fatty-acid-labeled viral proteins are directed to the 
membrane. M. Bryant & L. Ratner, supra. In this sense, the fatty acid is a targeting 
molecule. Hydrophobic groups other than fatty acids could provide the same 
function. Fatty acids could be incorporated by chemical treatment with fatty acyl 

25 halides or by coupling fatty acids to the proricin via carbodiimide chemistry. Upon 
secondary infection, when the virus containing the proricin variant fuses with the 
membrane of a new cell, it would release the HIV-protease-activated proricin variant 
into the cytoplasm, where the disulfide bond is reduced and the A chain is released, 
and thereby kills the cell before integration of the viral genome. It may be necessary 
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to use an HIV-protease- cleavable linker that is not the optimal sequence for cleavage 
such that activation of the variant is promoted only within virus particles. A second 
advantage of using a fatty acylated proricin variant is that the fatty acids could 
promote cellular internalization of the therapeutic agent, V. Y. Alakhov et ah, 
5 Increasing Cytostatic Effects of Ricin A Chain and Staphylococcus aureus 

Enterotoxin A through In Vitro Hydrophobization with Fatty Acid Residues, 12 
Biotechnol. Appl. l Biochem. 94-98 (1990), bypassing the complex internalization 
route for wild-type ricin involving the endoplasmic reticulum. This has been 
demonstrated with the ricin A chain. V.Y. Alakhov et al., supra. 

10 Mannose residues on the ricin B chain may be required for the lectin binding 

activity. M. Westby et al, Preparation and Characterization of Recombinant Proricin 
Containing an Alternative Protease-sensitive Linker Sequence, 3 Bioconjug. Chem. 
375-381 (1992). Therefore, a preferred embodiment of the invention involves 
making the proricin variant in a eukaryotic host, such as yeast, to provide the 

15 necessary glycosylation. The mannose residues are also targeting molecules and can 
direct native ricin to cells displaying mannose receptors, leading to ceil intoxication. 
D.L. Newton et al., supra', B.M. Simmons et al., supra. 

If one of the targeting molecules mentioned above is used, it may be possible 
to eliminate one or more of the galactose binding sites of the ricin B chain by point 

2 0 mutations, D.L. Newton et al., supra, or deletions. The mannose residues may also 
be eliminated by expression of a gene construct in a prokaryotic host or by enzymatic 
removal. However, at least one of the galactose binding sites appears to be required 
for effective cellular internalization of native ricin. D.L. Newton et al., supra. 
Leaving the galactose binding sites intact allows the possibility that nonspecific entry 

2 5 into cells would occur despite the incorporation of other targeting molecules. 

Intoxication, however, would still require that the cells expressed a protease that 
would cleave the modified proricin-based agent. The advantage of leaving one or 
both galactose binding sites intact is that the internalization of the agent into cells 
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would be enhanced. Of course, adding fatty acid residues may allow direct cellular 
internalization of the agent and avoid the need for galactose binding sites. 

Viral vectors could also be used to deliver a gene coding for this antiviral 
agent to human cells. HIV-based vectors capable of accomplishing this are known in 
5 the art. E.g., R. Zufferey et al., Self-inactivating Lentivirus Vector for Safe and 
Efficient In Vivo Gene Delivery, 72 J. Virol. 9873-9880 (1998); H. Miyoshi et al M 
Development of a l Self-inactivating Lentivirus Vector, 72 J. Virol. 8150-8157 (1998). 
Such an HIV-based vector would have the nucleic acid coding for the antiviral agent 
of the present invention inserted into it. This vector containing the nucleic acid 
10 coding for the antiviral agent would then be administered to patients according to 
methods known in the art. A percentage of the cells in the body would become 
transduced with this construct and would thereafter continuously produce the 
antiviral agent. The antiviral agent would circulate in the blood, creating a condition 
similar to that created by injection of exogenous antiviral agent into the patient. The 
15 circulating antiviral agent could enter any HIV infectable cell, but would be activated 
and kill the cell only in the presence of HIV protease. 

In still another embodiment of the invention, one of the termini of the ncm A 
chain is extended by means of making a fusion protein, then an adenine or adenine- 
like group is attached chemically. A cysteine residue may be incorporated into the 
20 extension to facilitate chemical attachment of the adenine group. The adenine-like 
group should not contain adenosine since the adenine moiety is likely to be cleaved 
from the ribose by the A chain. In this embodiment, the X (protease cleavable) 
linker is contained within the extension and the B chain is removed. The adenine 
residue would occupy the active site of the ricin A chain and inhibit activity. The 
2 5 extension linker would contain a cleavage site of an HIV protease. Upon cleavage, 
the linker/adenine would be separated and inhibition would be relieved, activating 
the ricin A chain. Adenine and pteroic acid, an adenine-like molecule, are known 
inhibitors of ricin. A. Pallanca et al., Uncompetitive Inhibition by Adenine of the 
RNA-N-glycosidase Activity of Ribosome-inactivating Proteins, 1384 Biochim. 
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Biophys. Acta 277-284 (1998); J.D. Robertas et aL, Structural Analysis of Ricin and 
Implications for Inhibitor Design, 34 Toxicon 1325-1334 (1996). It should be noted 
that there are two cysteine residues in the primary sequence of the ricin A chain that 
do not participate in a disulfide bond and are apparently not conserved among plant 
5 and bacterial ricin-like toxins. Y. Kitaoka, Involvement of the Amino Acids Outside 
the Active-site Cleft in the Catalysis of Ricin A Chain, 257 Eur. J. Biochem. 255-262 
(1998). Thus, the l se two cysteine residues may not be essential to the activity of the 
ricin A chain. These cysteine residues could be mutated to other residues, leaving 
only the cysteine in the extension linker. 8-Adeninethiol could be coupled to this 

10 cysteine residue with, for example, a bismaleimide reagent. Pteroic acid could be 
coupled to an amine in the linker by carbodiimide chemistry. Other adenine-like 
molecules and attachment chemistries are possible and are well known in the art. 

The gene for the proricin variant according to the present invention is 
preferably produced by amplifying the intron- free gene for preproricin in castor 

15 beans. The preproricin DNA sequence is known, e.g. F.I. Lamb et aL, Nucleotide 
Sequence of Cloned cDNA Coding for Preproricin, 148 Eur. J. Biochem. 265-270 
(1 985); J.W. Tregear & L.M. Roberts, The Lectin Gene Family of Ricinus 
communis: Cloning of a Functional Ricin Gene and Three Lectin Pseudogenes, 18 
Plant Mol. Biol. 515-525 (1992); K.C. Hailing et al., Genomic Cloning and 

2 0 Characterization of a Ricin Gene from Ricinus communis, 1 3 Nucleic Acids Res. 
8019-8033 (1985); L.M. Roberts et aL, Molecular Cloning of Ricin, 7 Targeted 
Diagn. Ther. 81-97 (1992); JP 1985102188-A; U.S. Patent No. 5,622,838. Methods 
for amplifying selected DNA segments, U.S. Patent No. 4,683,195; U.S. Patent No. 
4,683,202; U.S. Patent No. 4,800,159; U.S. Patent No. 4,965,188; PCR Technology: 

25 Principles and Applications for DNA Amplification (H. Erlich ed., Stockton Press, 
New York, 1989); PCR Protocols: A guide to Methods and Applications (Innis et al. 
eds, Academic Press, San Diego, Calif., 1990), and synthesizing oligonucleotides, 
S.A. Narang et aL, 68 Meth. Enzymol. 90 (1979) (phosphotriester method); E.L. 
Brown et al., 68 Meth. Enzymol. 109 (1979) (phosphodiester method); U.S. Patent 
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No. 4,356,270; U.S. Patent No. 4,458,066; U.S. Patent No. 4,416,988; U.S. Patent 
No. 4,293,652; N.D. Sinha et al., 24 Tetrahedron Lett. 5843 (1983); N.D. Sinha et 
al., 12 Nucl. Acids Res. 4539 (1984); N.D. Sinha et al., 15 Nucl. Acids Res. 397 
(1987); N.D. Sinha et al., 16 Nucl. Acids Res. 319 (1988), are well known in the art. 
5 Methods for joining such amplified DNAs and synthetic DNAs are also well known 
in the art, e.g. J. Sambrook & D. Russell, Molecular Cloning: A Laboratory Manual 
(3d ed., 2001); T.'Maniatis et al., Molecular Cloning: A Laboratory Manual (1982); 
F. Ausubel et al., Current Protocols in Molecular Biology (1987). 

Next, the DNA encoding the preproricin variant containing the HIV protease 

10 site is cloned in an appropriate expression vector for expression in a suitable host 

cell, as is well known in the art. Methods for cloning in these vectors are well known 
in the art, e.g., Sambrook, supra; Maniatis, supra; Ausubel, supra. Such suitable 
host cells are preferably eukaryotic cells, but expression in prokaryotic cells is also 
intended to be within the scope of the present invention. Expression vectors for 

15 expression in eukaryotic cells, e.g., yeast, insect, and mammalian cells, and 
prokaryotic cells , e.g., bacterial cells, are well known in the art and are 
commercially available from numerous companies or from depositories such as the 
American Type Culture Collection. Such expression vectors generally contain the 
necessary promoters, transcription termination signals, translation initiation and 

20 termination signals, and the like for expression of the cloned DNA in the selected 

host cell. The cloned DNA is then expressed in a selected host transformed with the 
expression vector containing the variant preproricin DNA. Transformation of 
bacterial and eukaryotic cells is well known in the art and can be carried out by any 
appropriate method, such as calcium-dependent methods, heat shock, 

2 5 electroporation, and other effective methods and combinations thereof. The 

expressed protein is then purified, preferably by affinity of the ricin B chain for 
galactose-labeled agarose beads if the variant contains functional galactose binding 
sites of the B chain. Such affinity-based purification methods are also well known in 
the art. 
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Next, the purified proricin variant is tested for susceptibility to HIV protease 
and various other proteases to determine if the proricin variant is indeed selectively 
activated by HIV protease. As a first matter, the HIV protease should digest the 
proricin variant peptide, resulting in ricin A and B chains linked by a disulfide bond! 
5 The resulting ricin is chemically reduced and tested for ricin A chain activity against 
ribosomes using commercial in vitro translation kits. 

The proricin variant containing the HIV protease site can be hydrophobized, 
as briefly described above. This modification has several advantages, including 
making it easier for the proricin variant to enter cells in vivo. Further, the 

10 hydrophobic agent coupled to the proricin variant has a tendency to direct the 

proricin variant to the inner surface of the cell membrane, as described above, where 
it can be incorporated into HIV particles during budding of the HIV particles through 
the cell membrane. Preferred hydrophobic agents include bile acids, sterols, and 
saturated and unsaturated fatty acids. Preferred bile acids include cholic acid, 

15 deoxycholic acid, chenodeoxycholic acid, lithocholic acid, ursocholic acid, 

ursodeoxycholic acid, isoursodeoxycholic acid, lagodeoxycholic acid, glycocholic 
acid, taurocholic acid, glycodeoxycholic acid, glycochenodeoxycholic acid, 
dehydrocholic acid, hyocholic acid, hyodeoxycholic acid, and mixtures thereof. 
Preferred sterols include cholestanol, coprostanol, cholesterol, epicholesterol, 

20 ergosterol, ergocalciferol, and mixtures thereof. Preferred saturated and unsaturated 
fatty acids comprise about 4 to 20 carbon atoms, such as butyric acid, valeric acid, 
caproic acid, caprylic acid, capric acid, lauric acid, myristic acid, palmitic acid, 
stearic acid, oleic acid, linoleic acid, linolenic acid, eleostearic acid, and mixtures 
thereof. The hydrophobic agent can be coupled to the proricin variant by chemical 

25 conjugation or, in appropriate cases, by using enzymes. For example, an alkanoic 
acid can be coupled to the proricin variant by forming an amide bond between the 
carboxylic acid group of the alkanoic acid and an amino group of the proricin 
variant. The amino group can be the N-terminal amino group of the proricin variant 
and/or one or more of the n-amino groups of lysine residues. As is well known in 
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the art, this reaction could be carried out chemically by activating the alkanoic acid 
by conversion to an acid chloride, and then reacting the acid chloride with the 
proricin variant to result in formation of the amide bond. 

In still another embodiment of the present invention, a targeting moiety is 

5 coupled to the proricin variant. For example, a peptide targeting agent can be 
coupled to the proricin molecule either chemically or by recombinant DNA 
methodology. In some embodiments, the compositions are constructed by 
chemically conjugating the targeting moiety to the proricin variant. "Chemically 
conjugating" the targeting moiety to the proricin variant, as that term is used herein, 

1 0 means covalently bonding the targeting moiety to the proricin variant either directly 
or by way of a spacer moiety. Such spacer moieties can include heterobifunctional 
crosslinkers, such as are well known in the art. Peptide portions of the compositions 
of the present invention can be produced in a genetically engineered organism, such 
as E. coli or yeast, as a "fusion protein." That is, a hybrid DNA containing a 

15 sequence of nucleotides encoding the targeting moiety and the proricin variant can be 
constructed by recombinant DNA technology. This hybrid DNA can be inserted into 
an organism such that the "fusion protein" encoded by the hybrid DNA is expressed, 
as described above. The fusion protein can then be purified by standard methods, 
including affinity chromatography. If the targeting moiety is a relatively short 

2 0 peptide, such peptide can also be chemically synthesized. Methods for synthesis of 
peptides are well known in the art. E.g., R.B. Merrifield, Solid Phase Peptide 
Synthesis,32 Adv. Enzymol. Relat. Areas Mol. Biol. 221-96 (1969); R.B. Merrifield 
et al. f 21 Biochemistry 5020-31 (1982) (solid phase peptide synthesis); Houghten, 82 
Proc. Nat'l Acad. Sci. USA 5131-35 (1985) (solid phase peptide synthesis); 

25 Hunkapilleretal., 310 Nature 105-111 (1984). 



Example 1 

In this example there is described an illustrative method for making a 
modified proricin according to the present invention. A DNA encoding modified 
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preproricin is assembled from two DNA fragments amplified by PCR using two sets 
of primers containing the selected modifications. These primers were designed using 
the Lasergene program (DNASTAR Inc., Madison, Wisconsin). The template for the 
PCR reaction is cellular DNA isolated from castor bean; this is possible because the 
5 preproricin gene contains no introns. The resulting modified protein replaces the 
first 10 amino acid residues in the native proricin linker, which contains 12 amino 
acid residues, witli the sequence Val-Ser-Gln-Asn-Tyr-Pro-Ile-Val-Gln-Asn (SEQ ID 
NO:4). This modified linker peptide (SEQ ID NO: 1 2) is a substrate for the HIV 
protease, which digests the peptide between the tyrosine and proline residues. The 
10 nucleotide sequence coding for the altered amino acid residues (SEQ ID NO:5) was 
chosen using the codons that are most frequently used in baker's yeast, 
Saccharomyces cerevisiae. This nucleotide sequence contains a restriction 
endonuclease digestion site in the middle portion of the modified sequence. This 
site, CAATTG, is recognized by the enzyme Mfel, which has no other sites in the 
15 preproricin gene sequence. Thus, the strategy for making the modified preproricin 
gene is to PCR-amplify the entire modified preproricin gene in two segments that 
overlap in the region encoding the modified linker. The PCR primers for amplifying 
these fragments are SEQ ID NO:6 and SEQ ID NO: 7 for amplifying the 5' portion 
of the preproricin gene and SEQ ID NO: 8 and SEQ ID NO:9 for amplifying the 3' 
2 0 portion of the gene. After amplification, both fragments are digested with Mfel and 
then ligated together to produce a complete, modified preproricin coding sequence 
(SEQIDNO:10). 

Since bacteria do not properly glycosylate eukaryotic proteins, and the 
available evidence suggests that proper processing and glycosylation is required for 
2 5 stability, solubility, and galactose binding activity of the modified preproricin, M. 
Westby et al., supra, the modified preproricin gene is expressed in yeast. Yeast 
glycosylate and process such a protein provided that it contains a typical eukaryotic 
leader sequence. P.T. Richardson et al., The Expression of Functional Ricin B-chain 
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in Saccharomyces cerevisiae, 950 Biochim. Biophys. Acta 385-394 (1988). Thus the 
upstream end of the amplified gene includes the entire coding sequence of 
preproricin instead of merely the coding sequence of the mature, processed protein 
(i.e., proricin). Although there are a large number of effective yeast expression 
5 vectors available in the art, in this example the modified preproricin gene is inserted 
in the yeast expression vector, pYES2, available commercially from Invitrogen 
(Carlsbad, CalifoAna). This vector contains origins of replication and selectable 
markers for replication and selection in both yeast and E. coli. In addition, this 
vector contains 10 unique restriction sites in its multiple cloning site, the large 

10 majority of which do not occur in the modified preproricin gene sequence. The Sad 
and Xhol sites were chosen for cloning and expression of the preproricin gene. Thus, 
a Sad site is engineered into the 5' end of the gene using the primer of SEQ ID 
NO:6, and an Xhol site is engineered into the 3' end of the gene using the primer of 
SEQ ID NO:9. Therefore, after PCR amplification, besides being digested with 

15 Mfel, each fragment is digested with Sad or Xhol, as appropriate; the pYES2 vector 
is digested with bothSacI and Xhol; and the digested preproricin gene fragments and 
pYES2 vector are mixed and ligated according to methods well known in the art. 
The sticky ends direct how the various fragments are joined, and experience shows 
that the vast majority of recovered circular plasmids have the desired structure. 

2 0 After ligation, the ligation mixture is used to transform competent E. coli 

cells according to methods well known in the art. Transformed cells are selected 
using the ampicillin marker on the pYES2 vector. Plasmid minipreps are prepared 
from several randomly selected colonies. Colonies containing the desired plasmid 
are identified by the size of the plasmid compared to the vector and by the presence 

2 5 of DNA fragments of the correct size following digestion with SacI and Mfel and 
with Mfel and Xhol. A large preparation of the plasmid containing the correct 
fragments is then prepared according to methods well known in the art and used to 
transform an appropriate yeasthost strain (e.g., INVScl, Invitrogen) using the 
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spheroplast procedure cr by making yeast cells competent using proprietary reagents 
(Invitrogen). Transformed cells are selected using the ura3 marker on the pYES2 
vector, and then selected cells are grown and expression of the modified proricin is 
induced by addition of galactose to the medium. After further growth for expression, 
5 the cells are harvested, washed, lysed by agitation with glass beads or by sonication, 
and a cell- free extract is prepared. The modified proricin is purified from this extract 
using its galactose-binding activity by chromatography on an affinity column 
containing either bound galactose or lactose, similar to the way in which recombinant 
fusion proteins containing the maltose binding protein are purified. Kellerman & 
10 Ferenci, 90 Methods in Enzymology 459-463 (1982). Such columns are available 
commercially, such as from Sigma Chemical Co. (St. Louis, Missouri). The bound 
protein is specifically eluted from the column with galactose or a galactose- 
containing molecule. 

¥ 4 Example 2 

15 In this example, the HIV-protease-susceptible linker SEQ ID NO: 1 3 is 

substituted for SEQ ID NfO:12 (see Example 1) using methods well known in the art. 

Example 3 

In this example, a polynucleotide encoding a fusion peptide, comprising yeast 
2 0 alpha factor leader-ricin A chain-HIV protease cleavable linker-ricin B chain 

containing a phe to gly mutation at the extreme C-terminus-stop signal, was inserted 
in the pYES2 yeast expression vector. The alpha factor leader allows secretion and 
translocation to the endoplasmic reticulum where glycosylation can occur. The phe 
to gly mutation allows for creation of a unique restriction site and exposes 
2 5 hydrophobic residues on the A chain, according to published crystal structure data. 
E. Rutenber & J.D. Robertus, Structure of Ricin B-chain at 2.5 A Resolution, 10 
Proteins 260-269(1991). 
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Example 4 

In this example, a polynucleotide encoding a fusion peptide, comprising: 
[maltose binding protein-factor Xa site-DP178-ricin A chain-HIV protease 
cleavable linker-truncated ricin B chain-repeat of hydrophobic val-ser-ile-leu-ile- 
5 pro-ile-ile-ala-leu-met-val (SEQ ID NO: 1 4) (duplicated from the C-terminus of the A 
chain)-stop signal], was inserted in the pMALp2X E. coli expression vector. DP 178 
is a section of the gp41 ectodomain involved in viral fusion to host cells. It is about 
40 amino acid residues in length and has been shown to inhibit infectivity. The 
truncated B chain includes the first cys residue and about an additional 7-8 residues. 
10 The hydrophobic repeat is intended to facilitate translocation across membranes, 
such as the lumen of the endoplasmic reticulum to the cytosol. 



Example 5 

' ' " ' In this exampfe, a gly-ala-arg-ala-ser (SEQ ID NO: 17) myristylation signal 
15 sequence (from HIV gag protein) was inserted between the factor Xa cleavage site 
and DP 1 78 of the construct of Example 4. The myristylation sequence should 
promote attachment of fatty acids in vivo, and should direct the therapeutic agent to 
an intracellular site of nascent virus particles and HIV protease activity. 



20 Example 6 

In this example, an "L domain" (pro-pro-pro-pro-tyr, SEQ ID NO: 15) and an 
ER lumen retention signal (lys-asp-glu-leu, SEQ ID NO: 1 6) are inserted after the 
repeat of the hydrophobic sequence from the C-terminus of the A chain of the 
construct of Example 5. The short L domain sequence has been fused to proteins 

2 5 other than the viral proteins in which it naturally occurs. It promotes the release of 
viral proteins from membranes by an unknown mechanism. The ER lumen retention 
signal has been shown to enhance toxicity of the ricin A chain when fused to the C- 
terminus thereof. 
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Example 7 

In this example, the procedure of Example 3 was carried out except that a 
single domain ricin B chain was substituted for the ricin B chain. The single domain 
ricin B chain contains no glycosylation sites, but has both galactose binding sites. 

5 

Example 8 

Three constructs were prepared for evaluating the concept that highly 
truncated ricin B chain can inhibit ricin A chain activity and thus permit an HIV- 
protease-dependent activation mechanism. These constructs were designed for 

10 cytosolic expression in yeast, and were expected to kill the cells if the A chain was in 
active form. Construct A comprises a fusion of ricin A chain-HIV protease 
cleavable linker-full ricin B chain, in pYES2. Construct B comprises a fusion of 
ricin A chain-HIV protease cleavable linker truncated at the expected cleavage site, 
in pYES2. Construct C comprises a fusion of ricin A chain-HIV protease cleavable 

1 5 linker-highly truncated B chain-repeat of hydrophobic sequence (SEQ ID NO: 1 4) 
duplicated from C-terminus of A chain, in pYES2. 

Yeast containing plasmids A, B, or C were streaked onto semi-solid media 
lacking uracil and containing glucose, raffinose, or raffinose + galactose. The 
pYES2-based plasmids were maintained in the cells because they confer the ability 

2 0 to grow in the absence of uracil. After incubation for 5 days at 30°C, the amount of 
growth was noted. Glucose permits growth but represses synthesis of the encoded 
agent. Raffinose permits growth but neither represses nor induces synthesis. 
Galactose alone does not permit growth, but induces synthesis of the encoded agent. 
Growth was obtained in all cases except for cell containing plasmid B on raffinose + 

2 5 galactose, wherein there was no growth. These results suggest that cleavage at the 

expected site by HIV protease results in activation of the ricin A chain and cell death. 
Further, the B chain may be highly truncated, albeit with a hydrophobic extension, 
and still inhibit A chain activity. 
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WE CLAIM ; 

1 . A composition comprising a compound represented by the formula 
(T m -A-X-B)-H n or (A-X-B-T m )-H n , wherein A is a protein synthesis inactivating 
toxin that is inactive until X is digested; X is a peptide susceptible to digestion by a 

5 viral protease; B is a lectin or a segment thereof, T is a targeting moiety, H is a 
hydrophobic agent, m is 0 or an integer of at least 1, and n is 0 or an integer of at 
least 1. 

2. The composition of claim 1 wherein said viral protease is from a 
10 retrovirus, picornavirus, rhinovirus, hepatitis C virus, or herpesvirus. 



3. The composition of claim 1 wherein A is a ricin A chain and B is a 
ricin B chain. 

15 4. The composition of claim 1 wherein X is a member selected from the 

group consisting of SEQ'lD NO:12 and SEQ ID NO: 13. 

5. The composition of claim 1 wherein said targeting moiety is a 
member selected from the group consisting of antigen-binding proteins, viral surface 

2 0 components and segments thereof, proteins that bind viral surface components, 
growth factors, lectins, and carbohydrates. 

6. The composition of claim 5 wherein said targeting moiety is a 
member selected from the group consisting of antibodies against gpl20, antibodies 

2 5 against gp41 , and the CD4 protein or segments thereof. 



7. The composition of claim 5 wherein said targeting moiety is an 
antigen-binding protein that binds the CD4 glycoprotein. 



WO 01/60393 



PCT/US01/05282 



40 

8. The comDosition of claim 7 wherein the protein that binds the CD4 
glycoprotein is gp 120 or a segment thereof. 

9. The composition of claim 1 wherein said targeting moiety is a GAG 
5 protein segment. 

10. The composition of claim 1 wherein said hydrophobic agent is a 
member selected from the group consisting of bile acids, sterols, and saturated and 
unsaturated fatty acids. 

10 

1 1 . The composition of claim 1 0 wherein said hydrophobic agent is a bile 
acid selected from the group consisting of cholic acid, deoxycholic acid, 
chenodeoxycholic acid, lithocholic acid, ursocholic acid, ursodeoxycholic acid, 
isoursodeoxycholic acid, lagodeoxycholic acid, glycocholic acid, taurocholic acid, 

15 glycodeoxycholic acid, glycochenodeoxycholic acid, dehydrocholic acid, hyocholic 
acid, hyodeoxycholic acid, and mixtures thereof 

1 2. The composition of claim 1 0 wherein said hydrophobic agent is a 
sterol selected from the group consisting of cholestanol, coprostanol, cholesterol, 

2 0 epicholesterol, ergosterol, ergocalciferol, and mixtures thereof. 



13. The composition of claim 10 wherein said hydrophobic agent is a 
saturated or unsaturated fatty acid comprising about 4 to 20 carbon atoms. 



25 14. The composition of claim 1 3 wherein said saturated or unsaturated 

fatty acid is a member selected from the group consisting of butyric acid, valeric 
acid, caproic acid, caprylic acid, capric acid, lauric acid, myristic acid, palmitic acid, 
stearic acid, oleic acid, linoleic acid, linolenic acid, eleostearic acid, and mixtures 
thereof. 
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15. The composition of claim 1 further comprising a pharmaceutical^ 
acceptable carrier admixed with the compound. 

5 16. The composition of claim 1 wherein A, B, or both are from ADP- 

ribosyltransferases. 

17. The composition of claim 1 wherein T comprises a myristylation 
signal sequence. 

10 

18. A composition comprising a compound represented by the formula N 
X-A or A-X-N, wherein A is a protein synthesis inactivating toxin that is inactive 
until X is digested, X is a peptide susceptible to digestion by a viral protease, and N 
is an adenine moiety or functional equivalent thereof. 

15 

19. The composition of claim 1 8 wherein A is a ricin A chain. 

20. The composition of claim 1 8 wherein X is a member selected from 
the group consisting of SEQ ID NO: 12 and SEQ ID NO: 13. 

20 

21. The composition of claim 18 wherein N is adenine. 

22. The composition of claim 18 wherein N is a member selected from 
the group consisting of pteroic acid and 8-adeninethiol. 

25 

23. A method for treating a human immunodeficiency virus infection 
comprising administering an effective amount of a composition comprising: 

(a) a member selected from the group consisting of: 
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(i) a compound represented by the formula (T m -A-X-B)-H n or 
(A-X-B-T m )-H n , wherein A is a protein synthesis inactivating toxin that is 
inactive until X is digested; X is a peptide susceptible to digestion by a 
human immunodeficiency virus protease; B is a lectin or a segment thereof, T 

5 is a targeting moiety, H is a hydrophobic agent, m is 0 or an integer of at least 

1 , and n is 0 or an integer of at least 1 , 

(ii) * a compound represented by the formula N-X-A or A-X-N, 
wherein A is a protein synthesis inactivating protein that is inactive until X is 
digested, X is a peptide susceptible to digestion by a human 

10 immunodeficiency virus protease, and N is an adenine moiety or functional 

equivalent thereof, and 

(iii) mixtures of (i) and (ii); and 
(b) a phsurgaceutically acceptable carrier. 

15 24. The method of claim 23 wherein A is a ricin A chain and B is a ricin 

B chain. 



20 



25 



25. The method of claim 23 wherein X is a member selected from the 
group consisting of SEQ ID NO:12 and SEQ ID NO: 13. 

26. The method of claim 23 wherein said targeting moiety is a member 
selected from the group consisting of antigen-binding proteins, viral surface 
components and segments thereof, proteins that bind viral surface components, 
growth factors, lectins, and carbohydrates. 

27. The method of claim 26 wherein said targeting moiety is an antigen- 
binding protein that binds the CD4 glycoprotein. 
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28. The method of claim 27 wherein the protein that binds the CD4 
glycoprotein is gp 120 or a segment thereof. 



29. The method of claim 23 wherein said targeting moiety is a GAG 
5 protein segment. 

30. Thfe method of claim 23 wherein said hydrophobic agent is a member 
selected from the group consisting of bile acids, sterols, and saturated and 
unsaturated fatty acids. 

10 

3 1 . The method of claim 30 wherein said hydrophobic agent is a bile acid 
selected from the group consisting of cholic acid, deoxycholic acid, 
chenodeoxycholic acid, lithocholic acid, ursocholic acid, ursodeoxycholic acid, 
isoursodeoxycholic acid, lagodeoxycholic acid, glycocholic acid, taurocholic acid, 

15 glycodeoxycholic acid, glycochenodeoxycholic acid, dehydrocholic acid, hyocholic 
acid, hyodeoxycholic acid, and mixtures thereof. 

32. The method of claim 30 wherein said hydrophobic agent is a sterol 
selected from the group consisting of cholestanol, coprostanol, cholesterol, 

2 0 epicholesterol, ergosterol, ergocalciferol, and mixtures thereof. 

33. The method of claim 30 wherein said hydrophobic agent is a saturated 
or unsaturated fatty acid comprising about 4 to 20 carbon atoms. 

2 5 34. The method of claim 33 wherein said saturated or unsaturated fatty 

acid is a member selected from the group consisting of butyric acid, valeric acid, 
caproic acid, caprylic acid, capric acid, lauric acid, myristic acid, palmitic acid, 
stearic acid, oleic acid, linoleic acid, linolenic acid, eleostearic acid, and mixtures 
thereof. 
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35. A nucleic acid encoding a peptide represented by the formula A-X-B 
wherein A is a protein synthesis inactivating toxin that is inactive until X is digested; 
X is a peptide susceptible to digestion by a human immunodeficiency virus protease; 
and B is a lectin or a segment thereof or other peptide targeting moiety. 

5 

36. The nucleic acid of claim 35 wherein A is a ricin A chain and B is a 
ricin B chain. 1 

37. The nucleic acid of claim 35 wherein X is a member selected from the 
10 group consisting of SEQ ID NO:12 and SEQ ID NO:13. 

38. A composition comprising a compound represented by the formula 
A-X-B, wherein A is a protein synthesis inactivating toxin that is inactive until X is 
digested; X is a peptide susceptible to digestion by a viral protease; and B is a lectin 
or a segment thereof. 

15 

i ... - 

39. The composition of claim 38 wherein said viral protease is from a 

retrovirus, picomavirus, rhinovirus, hepatitis C virus, or herpesvirus. 

40. The composition of claim 38 wherein X is a member selected from 
2 0 the group consisting of SEQ ID NO: 1 2 and SEQ ID NO: 1 3 . 

41 . The composition of claim 38 wherein A is a ricin A chain and B is a 
ricin B chain. 

2 5 42. The composition of claim 38 wherein A is a ricin A chain and B is a 

fragment of a ricin B chain. 

43. The composition of claim 38 further comprising a pharmaceutically 
acceptable carrier admixed with the compound. 
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44. A composition comprising a compound represented by the formula A- 
X-B, wherein A is a ricin A chain that is inactive until X is digested; X is a peptide 
susceptible to digestion by a viral protease and is a member selected from the group 

5 consisting of SEQ ID NO: 12 and SEQ ID NO:l 3; and B is a ricin B chain or a 
segment thereof. 

45. The composition of claim 44 further comprising a pharmaceutically 
acceptable carrier admixed with the compound. 



I ... .-■ . 
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SEQUENCE LISTING 

<110> Keener, William K. 

Ward, Thomas E. 

<120> SELECTIVE DESTRUCTION OF CELLS INFECTED WITH HUMAN 

IMMUNODEFICIENCY VIRUS 

<130> LIT-PI-529 

<160> 17 

<210> 1 

<211> 1698 

<212> DNA 

<213> Ricinus communis 

<308> Genbank E00351 

<400> .1 * . 

atg "tat-gSa gtg gca?aferife^T ctt *tgt itt£ tec acc tcaS ggg 45 
Met Tyr Ala Val Ala Thr Trp Leu Cys Phe Gly Ser Thr Ser Gly 

-20 -15 -10 

tgg tct ttc aca tta gag gat aac aac ata ttc ccc aaa caa tac 90 
Trp Ser Phe Thr Leu Glu Asp Asn Asn He Phe Pro Lys Gin Tyr 

-5 15 

cca att ata aac ttt acc aca gcg ggt gec act gtg caa age tac 135 
Pro He He Asn Phe Thr Thr Ala Gly Ala Thr Val Gin Ser Tyr 
10 15 20 

aca aac ttt ate aga get gtt cgc ggt cgt tta aca act gga get 180 
Thr Asn Phe He Arg Ala Val Arg Gly Arg Leu Thr Thr Gly Ala 
25 30 35 

gat gtg aga cat gat ata cca gtg ttg cca aac aga gtt ggt ttg 225 
Asp Val Arg His Asp He Pro Val Leu Pro Asn Arg Val Gly Leu 
40 45 50 

cct ata aac caa egg ttt at'ftta gtt gaa etc tea aat cat gca 270 
Pro He Asn Gin Arg Phe He Leu Val Glu Leu Ser Asn His Ala 
55 60 65 
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gag ctt tct gtt 
Glu Leu Ser Val 
70 

gtc ggc tac cgt 
Val Gly Tyr Arg 
85 

aat cag gaa gat 
Asn Gin Glu Asp 
100 

caa aat cga tat 
Gin Asn Arg Tyr 
115 

gaa caa ctt get 
Glu Gin Leu Ala 
130 

■- .• ^ ■■ a* - ■ 

ggt cca eta gag 
Gly Pro Leu Glu 
145 

ggt ggc 'act cag 
Gly Gly Thr Gin 
160 

ate caa atg att 
He Gin Met He 
175 

gaa atg cgc acg 
Glu Met Arg Thr 
190 

cct age gta att 
Pro Ser Val He 
205 

gca att caa gag 
Ala He Gin Glu 
220 

ctg caa aga cgt 
Leu Gin Arg Arg 
235 



aca tta gee ctg 
Thr Leu I\la Leu 



get gga aat age 
Ala Gly Asn Ser 



gca gaa gca ate 
Ala Glu Ala He 



aca ttc gee ttt 
Thr Phe Ala Phe 



ggt aat ctg aga 
Gly Asn Leu Arg 

gag get ate tea 
Glu Ala He Ser 

ctt "tea act "ctg 
Leu Pro Thr Leu 



tea gaa gca gca 
Ser Glu Ala Ala 



aga att agg tac 
Arg lie Arg Tyr 



aca ctt gag aat 
Thr Leu Glu Asn 



tct aac caa gga 
Ser Asn Gin Gly 



aat ggt tbc~ aaa 
Asn Gly Ser Lys 



gat gtc ace aat 
Asp Val Thr Asn 
75 

gca tat ttc ttt 
Ala Tyr Phe Phe 
90 

act cat ctt ttc 
Thr His Leu Phe 
105 

ggt ggt aat tat 
Gly Gly Asn Tyr 
120 

gaa aat ate gag 
Glu Asn lie Glu 
135 . 

gcg ctt tat tat 
Ala Leu Tyr Tyr 
150 

get* cgt"tcc ttt 
Ala Arg Ser Phe 
165 

aga ttc caa tat 
Arg Phe Gin Tyr 
180 

aac egg aga tct 
Asn Arg Arg Ser 
195 

agt tgg ggg aga 
Ser Trp Gly Arg 
210 

gee ttt get agt 
Ala Phe Ala Ser 
225 

ttc agt gtg tac 
Phe Ser Val Tyr 
240 



gca tat gtg 315 
Ala Tyr Val 
80 

cat cct gac 360 
His Pro Asp 
95 

act gat gtt 405 
Thr Asp Val 
110 

gat aga ctt 450 
Asp Arg Leu 
125 

ttg gga aat 495 
Leu Gly Asn 
140 

tac agt act 540 
Tyr Ser Thr 
155 

at a att fcgc 58 5 
He He Cys 
170 

att gag gga 630 
lie Glu Gly 
185 

gca cca gat 67 5 
Ala Pro Asp 
200 

ctt tec act 720 
Leu Ser Thr 
215 

cca att caa 765 
Pro He Gin 
230 

gat gtg agt 810 
Asp Val Ser 
245 
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ata 
He 


tta 
Leu 


ate 
He 


cct 
Pro 
250 


ate 
lie 


ata 
He 


get 
Ala 


etc 
Leu 


atg 
Met 
255 


gtg 
Val 


tat 
Tyr 


aga 
Arg 


tgc 
Cys 


gca 
Ala 
260 


cct 
Pro 


855 


5 


cca 
Pro 


cca 
Pro 


teg 
Ser 


tea 
Ser 
265 


cag 
Gin 


ttt 
Phe 


tct 
Ser 


ttg 
Leu 


ctt 
Leu 
270 


ata 
He 


agg 
Arg 


cca 
Pro 


gtg 

Val 


gta 
Val 
275 


cca 
Pro 


900 




aat 
Asn 


ttt 
Phe 


aat 
Asn 


get 
Ala 
280 


gat 
Asp 


gtt 
Val 


tgt 
Cys 


atg 
Met 


gat 
Asp 
285 


cct 
Pro 


gag 
Glu 


ccc 
Pro 


ata 
lie 


gtg 
Val 
290 


cgt 
Arg 


945 


10 


ate 
He 


gta 
Val 


ggt cga 
Gly Arg 
295 


aat 
Asn 


ggt 
Gly 


eta 
Leu 


tgt 
Cys 


gtt 
Val 
300 


gat 
Asp 


gtt 
Val 


agg 
Arg 


gat 
Asp 


gga aga 
Gly Arg 
305 


990 


. 15 


ttc 
Phe 

■-<» j* .mi: 


cac aac gga 
His Asn Gly 
310 


aac gca 
Asn Ala 

• W > ■ 


ata 
He 


cag ttg 
Gin Leu 
315 


tgg 
Trp 

y 


cca 
Pro 

•..«» .t 


tgc 
Cys 


aag 
Lys 

Mr m- 


tct 
Ser 
320 


aat 
Asn 

» r- 


1035 




aca 
Thr 


gat gca 
Asp Ala 


aat 
Asn 
325 


cag 

Gin 

> - 


etc 
Leu 


tgg 
Trp 


act 
Thr 


ttg 
Leu 
330 


aaa 
Lys 


aga 
Arg 


gac 
Asp 


aat 
Asn 


act 
Thr 
335 


att 
He 


1080 


20 


cga 
Arg 


tct 
Ser 


aat* gga 
Asn Gly 
340 


aag 
Lys 


tgt 

Cys 


tta 
Leu 


act" 
Thr 


'act' 
Thr 
345 


*• * 
tac 

Tyr 


ggg*" 

Gly 


tac 
Tyr 


kgt 
Ser 


ccg 
Pro 
350 


gga 
Gly 


1125 1 ; 




gtc 
Val 


tat 
Tyr 


gtg 
Val 


atg 
Met 
355 


ate 
He 


tat 
Tyr 


gat 
Asp 


tgc 
Cys 


aat 
Asn 
360 


act 
Thr 


get 
Ala 


gca 
Ala 


act 
Thr 


gat gec 
Asp Ala 
365 


1170 


25 


acc 
Thr 


cgc 
Arg 


tgg 
Trp 


caa 
Gin 
370 


ata 
He 


tgg 
Trp 


gat 
Asp 


aat 
Asn 


gga 
Gly 
375 


acc. 
Thr 


ate 
He 


ata 
lie 


aat 
Asn 


ccc 
Pro 
380 


aga 
Arg 


1215 


30 


tct 
Ser 


agt 
Ser 


eta 
Leu 


gtt 
Val 
385 


tta 
Leu 


gca 
Ala 


gcg 
Ala 


aca 
Thr 


tea 
Ser 
390 


ggg 

Gly 


aac 
Asn 


agt 
Ser 


ggt 
Gly 


acc 
Thr 
395 


aca 
Thr 


1260 




ctt 
Leu 


a eg 
Thr 


gtg 
Val 


caa 
Gin 
400 


acc 
Thr 


aac 
Asn 


att 
He 


tat 
Tyr 


gec 
Ala 
405 


gtt 
Val 


agt 
Ser 


caa 
Gin 


ggt 
Gly 


tgg 
Trp 
410 


ctt 
Leu 


1305 


35 


cct 
Pro 


act 
Thr 


aat 
Asn 


aat 
Asn 
415 


aca 
Thr 


caa 
Gin 


cct 
Pro 


ttt 
Phe 


gtt 
Val 
420 


aca 
Thr 


acc 
Thr 


att 
He 


gtt 
Val 


ggg 

Gly 
425 


eta 
Leu 


1350 



3 



WO 01/60393 PCT/US01/05282 





tat 
Tyr 


ggt 
Gly 


ctg 
Leu 


tgc 
Cys 
430 


ttg 
Leu 


caa 
Gin 


gca 
Ala 


aat 
Asn 


agt 
Ser 
435 


gga 
Gly 


caa 
Gin 


gta 
Val 


tgg 

Trp 


ata 
He 
440 


gag 
Glu 


1395 


5 


gac 
Asp 


tgt 
Cys 


age 
Ser 


agt 
Ser 
445 


gaa 
Glu 


aag 
Lys 


get 
Ala 


gaa 
Glu 


caa 
Gin 
450 


cag 
Gin 


tgg 
Trp 


get 
Ala 


ctt 
Leu 


tat 
Tyr 
455 


gca 
Ala 


1440 




gat 
Asp 


ggt 
Gly 


tea 
Ser 


ata 
He 
460 


cgt 
Arg 


cct 
Pro 


cag 
Gin 


caa 
Gin 


aac 
Asn 
465 


cga 
Arg 


gat 
Asp 


aat 
Asn 


tgc 
Cys 


ctt 
Leu 
470 


aca 
Thr 


1485 


10 


agt 
Ser 


gat 
Asp 


tct 
Ser 


aat 
Asn 
475 


ata 
He 


egg 
Arg 


gaa 
Glu 


aca 
Thr 


gtt 
Val 
480 


gtt 
Val 


aag 
Lys 


ate 
He 


etc 
Leu 


tct 
Ser 
485 


tgt 

Cys 


1530 




acre 

yy u 
Gly 

*:* * ■ 


cct gca 
Pro Ala 


tec 
Ser 


tct 
Ser 


aac 
Gly 


Gin 


caa 

Vp» y CL 

Arg 


taa 
»-yy 

Trp 

495 

f 


atg ttc 
Met Phe 

rtt i*t • 


aag 
Lys 


aat 
Asn 

. r 


aat 
Asp 
500 


aaa 
Gly 

' i * * 


1575 
* 




acc 
Thr 


att 
He 


tta 
Leu 


aat 
Asn 
505 


ttg 
Leu 


tat 

Tyr 

■ - 1 


agt 
Ser 


gga 
Gly 


ttg 
Leu 
510 


gtg 
Val 


tta 
Leu 


gat 
Asp 


gtg 
Val 


agg 
Arg 
515 


cga 
Arg 


1620 


20 


teg 
Ser 


« ■ 
gat 
Asp 


ccg 
Pro 


agd 
Ser 
520 


1 ct V 
.Leu 


aaa 
Lys 


caa 
Gin 


ate 
He 


att 
He 
525 


ctt' 
Leu 


tac 
Tyr 


cct 
Pro 


etc 
Leu 


cat 
His 
530 


ggt 
Gly 


1665 




gac 
Asp 


cca 
Pro 


aac 
Asn 


caa 
Gin 


ata 
He 


tgg 
Trp 


tta 
Leu 


cca 
Pro 


tta 
Leu 


ttt 
Phe 


tga 


1698 









535 540 



25 <210> 2 

<211> 1731 
<212> DNA 

<213> Ricinus communis 

<308> Genbank X52908 

30 <400> 2 

atg aaa ccg gga gga aat act att gta ata tgg atg tat gca gtg 4 5 

Met Lys Pro Gly Gly Asn Thr He Val He Trp Met Tyr Ala Val 
-35 -30 -25 
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gca aca tgg ctt tgt ttt gga tec acc tea ggg tgg tct ttc aca 90 
Ala Thr Trp Leu Cys Phe Gly Ser Thr Ser Gly Trp Ser Phe Thr 
-20 -15 -10 

tta gag gat aac aac ata ttc ccc aaa caa tac cca att ata aac 135 
5 Leu Glu Asp Asn Asn lie Phe Pro Lys Gin Tyr Pro He He Asn 
-5.-11 5 10 

ttt acc aca gcg ggt gec act gtg caa age tac aca aac ttt ate 180 
Phe Thr Thr Ala Gly Ala Thr Val Gin Ser Tyr Thr Asn Phe He 

15 20 25 

10 aga get gtt cgc ggt cgt tta aca act gga get gat gtg aga cat 225 
Arg Ala Val Arg Gly Arg Leu Thr Thr Gly Ala Asp Val Arg His 

30 35 40 

gaa ata cca gtg ttg cca aac aga gtt ggt ttg cct ata aac caa 270 
Glu He Pro Val Leu Pro Asn Arg Val Gly Leu Pro He Asn Gin 

egg ttt att tta gtt gaa etc tea aat cat gca gag ctt tct gtt 315 
Arg Phe He Leu Val Glu Leu Ser Asn His Ala Glu Leu Ser Val 

60' 65 70 



aca tta gcg ctg gat gtc acc aat gca tat gtg gtc ggc tac cgt 360 
20 Thr Leu Ala Leu Asp Val Thr Asn Ala Tyr Val Val Gly Tyr Arg 

75 80 85 

get gga aat age gca tat ttc ttt cat cct gac aat cag gaa gat 4 05 
Ala Gly Asn Ser Ala Tyr Phe Phe His Pro Asp Asn Gin Glu Asp 

90 95 100 

25 gca gaa gca ate act cat ctt ttc act gat gtt caa aat cga tat 4 50 
Ala Glu Ala He Thr His Leu Phe Thr Asp Val Gin Asn Arg Tyr 

105 HO H5 

aca ttc gec ttt gga ggt aat tat gat aga ctt gaa caa ctt get 495 
Thr Phe Ala Phe Gly Gly Asn Tyr Asp Arg Leu Glu Gin Leu Ala 
30 120 125 130 

ggt aat ctg aga gaa aat ate gag ttg gga aat ggt cca eta gag 54 0 
Gly Asn Leu Arg Glu Asn He Glu Leu Gly Asn Gly Pro Leu Glu 

135 140 145 

gag get ate tea gcg ctt tat" tat tac agt act ggt ggc act cag 585 
35 Glu Ala He Ser Ala Leu Tyr Tyr Tyr Ser Thr Gly Gly Thr Gin 

150 155 160 
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ctt cca act ctg get cgt tec ttt ata att tgc ate caa atg att 630 
Leu Pro Thr Leu Ala Arg Ssr Phe He He Cys He Gin Met He 

165 170 175 

tea gaa gca gca aga ttc caa tat att gag gga gaa atg cgc acg 675 
5 Ser Glu Ala Ala Arg Phe Gin Tyr He Glu Gly Glu Met Arg Thr 

180 185 190 

aga att agg tac aac egg aga tct gca cca gat cct age gta att 720 
Arg He Arg Tyr Asn Arg Arg Ser Ala Pro Asp Pro Ser Val He 

195 200 205 

10 aca ctt gag aat agt tgg ggg aga ctt tec act gca att caa gag 7 65 
Thr Leu Glu Asn Ser Trp Gly Arg Leu Ser Thr Ala He Gin Glu 

210 215 220 

tct aac caa gga gec ttt get agt cca att caa ctg caa aga cgt 810 
Ser Asn Gin Gly Ala Phe Ala Ser Pro lie Gin Leu Gin Arg Arg 
15 225 230 235 

aat ggt tec aaa ttc agt gtg tac gat gtg agt ata tta ate cct 855 
Asn Gly Ser Lys Phe Ser Val Tyr Asp Val Ser lie Leu lie Pro 

240 .245 250 

• ate ata get- etc atg : gtg ; tat aga tgc gca cct cca cca teg tea 900 
20 lie lie Ala Leu Met Val Tyr Arg Cys Ala Pro Pro Pro Ser Ser 

255 260 265 

cag ttt tct ttg ctt ata agg cca gtg gta cca aat ttt aat get 94 5 
Gin Phe Ser Leu Leu He Arg Pro Val Val Pro Asn Phe Asn Ala 

270 275 280 

25 gat gtt tgt atg gat cct gag ccc ata gtg cgt ate gta ggt cga 990 
Asp Val Cys Met Asp Pro Glu Pro He Val Arg lie Val Gly Arg 

285 290 295 

aat ggt eta tgt gtt gat gtt agg gat gga aga ttc cac aac gga 1035 
Asn Gly Leu Cys Val Asp Val Arg Asp Gly Arg Phe His Asn Gly 
30 300 305 310 

aac gca ata cag ttg tgg cca tgc aag tct aat aca gat gca aat 1080 
Asn Ala lie Gin Leu Trp Pro Cys Lys Ser Asn Thr Asp Ala Asn 

315 320 325 

cag etc tgg act ttg aaa aga -gac aat act att cga tct aat gga 1125 
35 Gin Leu Trp Thr Leu Lys Arg Asp Asn Thr lie Arg Ser Asn Gly 

330 335 340 
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aag tgt tta act act tac ggg tac agt ccg gga gtc tat gtg atg 1170 
Lys Cys Leu Thr Thr Tyr Gly Tyr Ser Pro Gly Val Tyr Val Met 

345 350 355 

ate tat gat tgc aat act get gca act gat gec acc cgc tgg caa 1215 
5 He Tyr Asp Cys Asn Thr Ala Ala Thr Asp Ala Thr Arg Trp Gin 

360 365 370 

ata tgg gat aat gga acc ate ata aat ccc aga tct agt eta gtt 1260 
He Trp Asp Asn Gly Thr He He Asn Pro Arg Ser Ser Leu Val 

375 380 385 

10 tta gca gcg aca tea ggg aac agt ggt acc aca ctt aca gtg caa 1305 
Leu Ala Ala Thr Ser Gly Asn Ser Gly Thr Thr Leu Thr Val Gin 

390 395 400 

acc aac att tat gec gtt agt caa ggt tgg ctt cct act aat aat 1350 
Thr Asn He Tyr Ala Val Ser Gin Gly Trp Leu Pro Thr Asn Asn 
15 405 410 415 

aca caa cct ttt gtg aca acc att gtt ggg eta tat ggt ctg tgc 1395 
Thr Gin Pro Phe Val Thr Thr He Val Gly Leu Tyr Gly Leu Cys 

420. . 425* . 430 

ttg caa gca aat agt ' gga caa gta tgg ata gag gac tgt age agt 1440 
20 Leu Gin Ala Asn Ser Gly Gin Val Trp He Glu Asp Cys Ser Ser 

435 440 445 

gaa aag get gaa caa cag tgg get ctt tat gca gat ggt tea ata 1485 
Glu Lys Ala Glu Gin Gin Trp Ala Leu Tyr Ala Asp Gly Ser He 

450 455 460 

25 cgt cct cag caa aac cga gat aat tgc ctt aca agt gat tct aat 1530 
Arg Pro Gin Gin Asn Arg Asp Asn Cys Leu Thr Ser Asp Ser Asn 

465 470 475 

ata egg gaa aca gtt gtc aag ate etc tct tgt ggc cct gca tec 1575 
He Arg Glu Thr Val Val Lys He Leu Ser Cys Gly Pro Ala Ser 
30 480 485 490 

tct ggc caa cga tgg atg ttc aag aat gat gga acc att tta aat 1620 
Ser Gly Gin Arg Trp Met Phe Lys Asn Asp Gly Thr He Leu Asn 

495 500 505 

ttg tat agt ggg ttg gtg tta- gat gtg agg gca teg gat ccg age 1665 
35 Leu Tyr Ser Gly Leu Val Leu Asp Val Arg Ala Ser Asp Pro Ser 

510 515 520 
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ctt aaa caa ate att ctt tac cct etc cat ggt gac cca aac caa 1710 
Leu Lys Gin He He Leu Tyr Pro Leu His Gly Asp Pro Asn Gin 

525 530 535 



ata tgg tta cca 
He Trp Leu Pro 



tta ttt tga 1731 

Leu Phe 

540 



<210> 
<211> 
<212> 
<213> 
<308> 
<400> 



3 

1731 
DNA 

Ricinus communis 
Genbank X03179 



atg aaa ccg gga gga aat act att gta ata tgg atg tat gca gtg 4 5 

Met Lys Pro Gly Gly Asn Thr He Val He Trp Met Tyr Ala Val 
-35 -30 -25 

gca aca tgg ctt.tgt ttt gga tec acc tea ggg tgg tct ttc aca 90 

Ala Thr Trp Leu Cys Phe Gly Ser Thr Ser Gly Trp Ser Phe Thr 
-20 -15 -10 

tta gag gat aac aac ata ttc ccc aaa caa tac cca att ata aac 135 

Leu Glu Asp Asn Asn He Phe Pro Lys Gin Tyr Pro He He Asn 

-5 -11 5 10 

ttt acc aca gcg ggt gee act gtg caa age tac aca aac ttt ate 180 

Phe Thr Thr Ala Gly Ala Thr Val Gin Ser Tyr Thr Asn Phe He 

15 20 25 

aga get gtt cgc ggt cgt tta aca act gga get gat gtg aga cat 225 

Arg Ala Val Arg Gly Arg Leu Thr Thr Gly Ala Asp Val Arg His 

30 35 40 

gaa ata cca gtg ttg cca aac aga gtt ggt ttg cct ata aac caa 270 

Glu He Pro Val Leu Pro Asn Arg Val Gly Leu Pro He Asn Gin 

45 50 55 

egg ttt att tta gtt gaa etc tea aat cat gca gag ctt tct gtt 315 
Arg Phe He Leu Val Glu Leu Ser Asn His Ala Glu Leu Ser Val 

60 65 70 
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aca tta gcg ctg 
Thr Leu Ala Leu 



get gga aat age 
Ala Gly Asn Ser 



gca gaa gca ate 
Ala Glu Ala lie 



aca ttc gec ttt 
Thr Phe Ala Phe 



ggt aat ctg aga 
Gly Asn Leu Arg 



gag get ate tea 
Glu Ala lie Ser 

ctt cca act v ct§ 
Leu Pro Thr Leu. 



tea gaa gca gca 
Ser Glu Ala Ala 



aga att agg tac 
Arg lie Arg Tyr 



aca ctt gag aat 
Thr Leu Glu Asn 



tct aac caa gga 
Ser Asn Gin Gly 



aat ggt tec aaa 
Asn Gly Ser Lys 



gat gtc acc aat 

Asp Val Thr Asn 
75 

gca tat ttc ttt 

Ala Tyr Phe Phe 
90 

act cat ctt ttc 

Thr His Leu Phe 
105 

ggt ggt aat tat 

Gly Gly Asn Tyr 
120 

gaa aat ate gag 

Glu Asn lie Glu 

gcg ctt tat tat 
Ala Leu Tyr Tyr 
150, 

get cgt tec ttt 
Ala Arg Ser Phe 
165 

aga ttc caa tat 
Arg Phe Gin Tyr 
180 

aac egg aga tct 
Asn Arg Arg Ser 
195 

agt tgg ggg aga 
Ser Trp Gly Arg 
210 

gee ttt get agt 
Ala Phe Ala Ser 
225 

ttc agt gtg'tac 
Phe Ser Val Tyr 
240 



gca tat gtg gtc 
Ala Tyr Val Val 
80 

cat cct gac aat 
His Pro Asp Asn 
95 

act gat gtt caa 
Thr Asp Val Gin 
110 

gat aga ctt gaa 
Asp Arg Leu Glu 
125 

ttg gga aat ggt 
Leu Gly Asn Gly 
140 

tac agt act ggt 
Tyr Ser Thr Gly 
155 

ata att tgc ate 
He He Cys He 
170 

att gag gga gaa 
He Glu Gly Glu 
185 

gca cca gat cct 
Ala Pro Asp Pro 
200 

ctt tea act gca 
Leu Ser Thr Ala 
215 

cca att caa ctg 
Pro He Gin Leu 
230 

gat gtg agt ata 
Asp Val Ser He 
245 



ggc tac cgt 360 
Gly Tyr Arg 
85 

cag gaa gat 405 
Gin Glu Asp 
100 

aat cga tat 450 
Asn Arg Tyr 
115 

caa ctt get 495 
Gin Leu Ala 
130 

cca eta gag 540 
Pro Leu Glu 

ggc act cag 585 
Gly Thr Gin 
160 

caa atg att 630 
Gin Met He 
175 

atg cgc acg 675 
Met Arg Thr 
190 

age gta att 720 
Ser Val He 
205 

att caa gag 765 
He Gin Glu 
220 

caa aga cgt 810 
Gin Arg Arg 
235 

tta ate cct 855 
Leu He Pro 
250 
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ate ata get etc atg gtg tat aga tgc gca cct cca cca teg tea 900 
He He Ala Leu Met Val Tyr Arg Cys Ala Pro Pro Pro Ser Ser 

253 260 265 

cag ttt tct ttg ctt ata agg cca gtg gta cca aat ttt aat get 945 
5 Gin Phe Ser Leu Leu He Arg Pro Val Val Pro Asn Phe Asn Ala 

270 275 280 

gat gtt tgt atg gat cct gag ccc ata gtg cgt ate gta ggt cga 990 
Asp Val Cys Met Asp Pro Glu Pro He Val Arg lie Val Gly Arg 

285 290 295 

10 aat ggt eta tgt gtt gat gtt agg gat gga aga ttc cac aac gga 1035 
Asn Gly Leu Cys Val Asp Val Arg Asp Gly Arg Phe His Asn Gly 

300 305 310 

aac gca ata cag ttg tgg cca tgc aag tct aat aca gat gca aat 1080 
Asn Ala He Gin Leu Trp Pro Cys Lys Ser Asn Thr Asp Ala Asn 
15 315 320 325 

cag etc tgg act ttg aaa aga gac aat act att cga tct aat gga 1125 
Gin Leu Trp Thr Leu Lys Arg Asp Asn Thr He Arg Ser Asn Gly 

330 335 340 

aag tgt tta act act tac ggg tac agt ccg gga gtc tat gtg atg 1170 
20 Lys Cys Leu Thr Thr Tyr Gly Tyr Ser Pro Gly Val Tyr Val Met 

345 350 355 

ate tat gat tgc aat act get gca act gat gec ace cgc tgg caa 1215 
lie Tyr Asp Cys Asn Thr Ala Ala Thr Asp Ala Thr Arg Trp Gin 

360 365 370 

25 ata tgg gat aat gga ace ate ata aat ccc aga tct agt eta gtt 1260 
He Trp Asp Asn Gly Thr lie lie Asn Pro Arg Ser Ser Leu Val 

375 380 385 

tta gca gcg aca tea ggg aac agt ggt acc aca ctt aca gtg caa 1305 
Leu Ala Ala Thr Ser Gly Asn Ser Gly Thr Thr Leu Thr Val Gin 
30 390 395 400 

acc aac att tat gee gtt agt caa ggt tgg ctt cct act aat aat 1350 
Thr Asn lie Tyr Ala Val Ser Gin Gly Trp Leu Pro Thr Asn Asn 

405 410 415 

aca caa cct ttt gtg aca acc- att gtt ggg eta tat ggt ctg tgc 1395 
35 Thr Gin Pro Phe Val Thr Thr lie Val Gly Leu Tyr Gly Leu Cys 

420 425 430 
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ttg caa 
Leu Gin 


gca 
Ala 


aat 
Asn 


agt 
Ser 
435 


gga 
Gly 


caa 
Gin 


gta 
Val 


tgg 
Trp 


ata 
He 
440 


gag 
Glu 


gac 
Asp 


tgt 
Cys 


age 
Ser 


agt 
Ser 
445 


5 


gaa aag 
Glu Lys 


get 
Ala 


gaa 
Glu 


caa 
Gin 
450 


cag 
Gin 


tgg 
Trp 


get 
Ala 


ctt 
Leu 


tat 
Tyr 
455 


gca 
Ala 


gat 
Asp 


ggt 
Gly 


tea 
Ser 


ata 
He 
460 




cgt cct 
Arg Pro 


cag 
Gin 


caa 
Gin 


aae 
Asn 
465 


cga 
Arg 


gat 
Asp 


aat 
Asn 


tgc 
Cys 


ctt 
Leu 
470 


aca 
Thr 


agt 
Ser 


gat 
Asp 


tct 
Ser 


aat 
Asn 
475 


10 


ata egg 
He Arg 


gaa 
Glu 


aca 
Thr 


gtt 
Val 
480 


gtc 
Val 


aag 
Lys 


ate 
He 


etc 
Leu 


tct 
Ser 
485 


tgt 
Cys 


ggc 
Gly 


cct 
Pro 


gca 
Ala 


tec 
Ser 
490 


lb 


tct ggc 
Ser Gly 


caa 
Gin 


cga 
Arg 


tgg 
Trp 


atg 
Met 


ttc 
Phe 


aag 
Lys 


aat 
Asn 


gat 
Asp 
500 


gga ace 
Gly Thr 


att 
He 


tta 
Leu 


aat 
Asn 
505 




ttg tat 
Leu Tyr 


agt 
Ser 


ggg 

Gly 


ttg 
Leu 
510 


gtg 
Val 


tta 
Leu 


gat 
Asp 


gtg 
Val 


agg 
Arg 
515 


gca 
Ala 


teg 
Ser 


gat 
Asp 


ccg 
Pro 


age 
Ser 
520 


20 


ctt aaa 
Leu Lys 


caa 
Gin 


ate 

He. 


att 
He 
525 


ctt 
Leu 


tac 
Tyr 


cct 
Pro 


etc 
Leu 


cat 
His 
530 


ggt gac 
Gly Asp 


cca 
Pro 


aac 
Asn 


caa 
Gin 
535 




ata tgg 
He Trp 


tta 
Leu 


cca 
Pro 


tta 
Leu 
540 


ttt 
Phe 


tga 


1731 














25 


<£. 1U> 


4 




























<211> 


10 


























<212> 


PRT 


























<213> 


Human 


immunodeficiency virus 














<400> 


4 


























30 


Val Ser 
1 

<210> 


Gin 
5 


Asn 


Tyr 
5 


Pro 


He 


Val 


Gin 


Asn 
10 
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30 
DNA 

Artificial Sequence 



Encodes the modified proricin linker sequence of SEQ ID 
NO: 4. 



<211> 
<212> 
<213> 
<220> 
<223> 

<400> 



gtt tct caa aac tac cca att gtt caa aat 30 
Val Ser Gin Asn Tyr Pro He Val Gin Asn 
15 10 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



6 

29 : 1 

DNA 

Artificial Sequence 



Primer for amplifying the 5* portion of the preproricin 
gene and incorporating a SacI recognition site. 



<400> 6 

ctcgagctct gaaaccggga ggaaatact 29 



<210> 


7 


<211> 


50 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer for 



~3 r s 

Mfel recognition site. 



12 



WO 01/60393 



PCT/US01/05282 



10 



<400> 


7 


ttgaacaatt gggtagtttt 


<210> 


8 


<211> 


51 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer for 



<400> 



gene, mutating the linker sequence, and incorporating a 
Mfel recognition site. 



ctacccaatt gttcaaaatt ttaatgctga tgtttgtatg g 51 





<210> 


9 




<211> 


29 


15 


<212> 


DNA 




<213> 


Artificial Sequence 




<220> 






<223> 


Primer for amplifying the 
gene and incorporating an 


20 


<400> 


9 




ctcctcgagt ttaagccatc tattttcat 29 




<210> 


10 



25 



<211> 1805 
<212> DNA 

<213> Ricinus communis 
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<400> 10 

ctcgagctct gaaaccggga ggaaatacta ttgtaatatg g atg tat 47 

Met Tyr 



gca gtg gca aca tgg ctt tgt ttt gga tec acc tea ggg tgg tct 92 
Ala Val Ala Thr Trp.Leu Cys Phe Gly Ser Thr Ser Gly Trp Ser 
-20 * -15 -10 

ttc aca tta gag gat aac aac ata ttc ccc aaa caa tac cca att 137 
Phe Thr Leu Glu Asp Asn Asn He Phe Pro Lys Gin Tyr Pro He 
-5 15 

ata aac ttt acc aca gcg ggt gec act gtg caa age tac aca aac 182 
He Asn Phe Thr Thr Ala Gly Ala Thr Val Gin Ser Tyr Thr Asn 
10 15 20 

ttt ate aga get gtt cgc ggt cgt tta aca act gga get gat gtg 227 
Phe lie Arg Ala Val Arg Gly Arg Leii Thr Thr Gly Ala Asp Val 
25 30 35 

aga cat gaa ata cca gtg ttg cca aac aga gtt ggt ttg cct ata 272 
Arg His Glu He Pro Val Leu Pro Asn Arg Val Gly Leu Pro He 
40 45 50 

aac caa egg ttt att tta gtt gaa etc tea aat cat gca gag ctt 317 
Asn Gin Arg Phe He Leu Val Glu Leu Ser Asn His Ala Glu Leu 
55 60 65 

tct gtt aca tta gcg ctg gat gtc acc aat gca tat gtg gtc ggc 362 
Ser Val Thr Leu Ala Leu Asp Val Thr Asn Ala Tyr Val Val Gly 
70 75 80 

tac cgt get gga aat age gca tat ttc ttt cat cct gac aat cag 407 
Tyr Arg Ala Gly Asn Ser Ala Tyr Phe Phe His Pro Asp Asn Gin 
85 90 95 

gaa gat gca gaa gca ate act cat ctt ttc act gat gtt caa aat 452 
Glu Asp Ala Glu Ala He Thr His Leu Phe Thr Asp Val Gin Asn 
100 105 HO 

cga tat aca ttc gee ttt ggt ggt aat tat gat aga ctt gaa caa 4 97 
Arg Tyr Thr Phe Ala Phe Gly Gly Asn Tyr Asp Arg Leu Glu Gin 
115 1*20" 125 

ctt get ggt aat ctg aga gaa aat ate gag ttg gga aat ggt cca 542 
Leu Ala Gly Asn Leu Arg Glu Asn He Glu Leu Gly Asn Gly Pro 
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130 135 140 

eta gag gag get ate tea gcg ctt tat tat tac agt act ggt ggc 5.87 

Leu Glu Glu Ala lie Ser Ala Leu Tyr Tyr Tyr Ser Thr Gly Gly 
145 150 155 

5 act cag ctt cca act ctg get cgt tec ttt ata att tgc ate caa 632 

Thr Gin Leu Pro Thr 4 Leu Ala Arg Ser Phe lie lie Cys lie Gin 
160 165 170 

atg att tea gaa gca gca aga ttc caa tat att gag gga gaa atg 677 

Met He Ser Glu Ala Ala Arg Phe Gin Tyr He Glu Gly Glu Met 
10 175 180 185 

cgc acg aga att agg tac aac egg aga tct gca cca gat cct age 722 

Arg Thr Arg He Arg Tyr Asn Arg Arg Ser Ala Pro Asp Pro Ser 
190 195 200 

gta att aca ctt gag aat agt tgg ggg aga ctt tea act gca att 767 

15 " Val lie Thr Leu Glu rfsh Ser Trp Gly Arg Leu Ser Thr Ala lie 
205 210 215 

caa gag tct aac caa gjja gee ttt get agt cca att caa ctg caa 812 

Gin Glu Ser Asn Gin Gly Ala Phe Ala Ser Pro He Gin Leu Gin 
220 225 * 230 

20 aga cgt aat ggt tec aaa ttc agt gtg tac gat gtg agt ata tta 857 

Arg Arg Asn Gly Ser Lys Phe Ser Val Tyr Asp Val Ser He Leu 
235 240 245 

ate cct ate ata get etc atg gtg tat aga tgc gca cct cca cca 902 

He Pro He He Ala Leu Met Val Tyr Arg Cys Ala Pro Pro Pro 
25 250 255 260 

teg tea cag ttt gtt tct caa aac tac cca att gtt caa aat ttt 947 

Ser Ser Gin Phe Val Ser Gin Asn Tyr Pro He Val Gin Asn Phe 
265 270 275 

aat get gat gtt tgt atg gat cct gag ccc ata gtg cgt ate gta 992 

30 Asn Ala Asp Val Cys Met Asp Pro Glu Pro He Val Arg He Val 
280 285 290 

ggt cga aat ggt eta tgt gtt gat gtt agg gat gga aga ttc cac 1037 

Gly Arg Asn Gly Leu Cys Val Asp Val Arg Asp Gly Arg Phe His 
295 300- 305 

35 aac gga aac gca ata cag ttg tgg cca tgc aag tct aat aca gat 1082 

Asn Gly Asn Ala He Gin Leu Trp Pro Cys Lys Ser Asn Thr Asp 
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310 315 320 

gca aat cag etc tgg act ttg aaa aga gac aat act att cga tct 1127 

Ala Asn Gin Leu Trp Thr Leu Lys Arg Asp Asn Thr lie Arg Ser 
325 330 335 

5 aat gga aag tgt tta act act tac ggg tac agt ccg gga gtc tat 1172 

Asn Gly Lys Cys Leu Thr Thr Tyr Gly Tyr Ser Pro Gly Val Tyr 
340 345 350 

gtg atg ate tat gat tgc aat act get gca act gat gee acc cgc 1217 

Val Met lie Tyr Asp Cys Asn Thr Ala Ala Thr Asp Ala Thr Arg 
10 355 360 365 

tgg caa ata tgg gat aat gga acc ate ata aat ccc aga tct agt 1262 

Trp Gin lie Trp Asp Asn Gly Thr lie lie Asn Pro Arg Ser Ser 
370 375 380 

eta gtt tta gca gcg aca tea ggg aac agt ggt acc aca ctt aca 1307 

15 Leu Val Leu Ala Ala TKr Ser Gly Asn Ser Gly Thr Thr Leu Thr 
385 390 395 

gtg caa acc aac att tat gee gtt agt caa ggt tgg ctt cct act 1352 

Val Gin Thr Asn lie Tyr Ala Val Ser Gin Gly Trp Leu Pro Thr 
400 405 410 

20 aat aat aca caa cct ttt gtg aca acc att gtt ggg eta tat ggt 1397 

Asn Asn Thr Gin Pro Phe Val Thr Thr lie Val Gly Leu Tyr Gly 
415 420 425 

ctg tgc ttg caa gca aat agt gga caa gta tgg ata gag gac tgt 1442 

Leu Cys Leu Gin Ala Asn Ser Gly Gin Val Trp lie Glu Asp Cys 
25 430 435 440 

age agt gaa aag get gaa caa cag tgg get ctt tat gca gat ggt 1487 

Ser Ser Glu Lys Ala Glu Gin Gin Trp Ala Leu Tyr Ala Asp Gly 
445 450 455 

tea ata cgt cct cag caa aac cga gat aat tgc ctt aca agt gat 1532 

30 Ser lie Arg Pro Gin Gin Asn Arg Asp Asn Cys Leu Thr Ser Asp 
460 465 470 

tct aat ata egg gaa aca gtt gtc aag ate etc tct tgt ggc cct 1577 

Ser Asn He Arg Glu Thr Val Val Lys He Leu Ser Cys Gly Pro 
475 480- 485 

35 gca tec tct ggc caa cga tgg atg ttc aag aat gat gga acc att 1622 

Ala Ser Ser Gly Gin Arg Trp Met Phe Lys Asn Asp Gly Thr He 
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490 495 500 

tta aat ttg tat agt ggg ttg gtg tta gat gtg agg gca teg gat 1667 
Leu Asn Leu Tyr Ser Gly Leu Val Leu Asp Val Arg Ala Ser Asp 
505 510 515 

5 ccg age ctt aaa caa ate att ctt tac cct etc cat ggt gac cca 1712 
Pro Ser Leu Lys Gin lie lie Leu Tyr Pro Leu His Gly Asp Pro 
520 525 530 

aac caa ata tgg tta cca tta ttt tgatagacag attactctct 1756 
Asn Gin lie Trp Leu Pro Leu Phe 
10 535 540 

tgcagtgtgt atgtcctgcc atgaaaatag atggcttaaa ctcgaggag 1805 

<210> 11 
<211> 10 
<212> PRT 
15 . <213> Human immunodef i.eiency virus 
<400> 11 

Ser Ala Thr He Met Met Gin Arg Gly Asn 
15 10 

<210> 12 

20 <211> 12 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 12 

Val Ser Gin Asn Tyr Pro He Val Gin Asn Phe Asn 
25 1 5 10 

<210> 13 
<211> 12 



17 



WO 01/60393 PCT/US01/05282 



<212> PRT 

<213> Human immunodeficiency virus 

<400> , 13 

Ser Ala Thr He Met Met Gin Arg Gly Asn Phe Asn 

5 1 5 10 

<210> 14 

<211> 12 

<212> PRT 

<213> Ricinus communis 

10 <400> ^ 14 

Val Ser He Leu He Pro He He Ala Leu Met Val 

15 10 

<210>. 15 

<211> 5 

15 <212> PRT 

<213> Rous sarcoma virus 

<400> 15 

Pro Pro Pro Pro Thr 

1 5 

20 <210> 16 

<211> 4 

<212> PRT 

<213> Homo sapiens 

<400> 16 

25 Lys Asp Glu Leu 
1 
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<210> 17 

<211> 5 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 17 

Gly Ala Arg Ala Ser 
1 5 



19 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/USO 1/05989 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(7) :A6 1 K 38/00, 39/40 
US CL 5 14/91; 494/ 1 83. 1 
According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 514/91; 494/183.1 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

DIALOG. BIOSIS, EM BASE, MEDLINE, CASEARCH, toxins, rictn, rips, hiv, antiviral, activated, protease, protein synthesis 
inactivating toxin 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5,981,493 A (LAGRONE) 09 November 1999, entire document. 



1-45 



| | Further documents are listed in the continuation of Box C | | See patent family annex. 



"A* 

•L" 

"0- 
"P" 



8*adal mliftubw rfoitea aocwutc 

feeuMaft iafiaiaj Um pm*nl state of Um art wbkk k ■<* conaiiiraa 
to ha at taitJMU* talmaea 

tartfer t nw nt aahhihal am a* after Um iakmattoaal filin« late 



later tsenuit r^lfcfcaa after Um iateraatfcnalfilin* iato m fxtocttr 
fate an* met, U eeaflfat with Um Kgrnikutim hat citea to uaaataaa 
um pueifb m iU*rj «»aarlnM Um tavmtioa 



ililMil laiaattoa «au<* I 
la lavalia aa lamtiTC at 



at vafaa suf tknv aoaht mi priarirf eliia(i) « vafea a 
citea to -U*lkh Um fahUeatfea <* m«umt oiutte* or otter 
*M»ial UN (a. wrmMmi) 

aaeuMat rafrrlaf to aa oral cteokaara, aw, cihftitlnfi ax other 



aooaaart pAlsutoi fxfe* to Um iateraatioaal CUn. late tat liter 
Una Um friariir Sate claiiaaa 



aoovauat of f ralmaea; 
ofriaraa aeaal ar uuot ha 
whn Um inn— it to tokaa aloa* 



aaeajDMat of sutfemlax ralmaaa; UM aliiaaH lsvaattaa taaatt ha 
ooaxfcaaraa to laTorrc aa iomtrM ate) vhaa Um iooannt k MMitoil 
▼Uh aaa «r toes* eUtot aaeh iocajaaate, noh aotobtoattMi hauuj 
afcrioaa to a vaaoai driuaa. ia tha art 



•sr 



r of thai 



» jateal faaiirr 



Date of the actual completion of the international search 
16 JULY 8001 



Date of mailing of the international search report 




26 JUL 2001 



T. PARK, PRD. (J 



Name and mailing address of the ISA/US 
Commas ion er of Patents and Trade majfcs 

Box PCT 

Washingtem, D C 20331 
Facsimile No. (705) 305-9950 



Authorized ofTtcer 

LNKYEL 
fephone No. (705) 505-0196 



Form PCT/ISA/fllO (second sheet) (July 1998)* 



